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Preface 



The present thesis contains an exposition of interpolation theory in harmonic 
analysis, focusing on the complex method of interpolation. Broadly speaking, 
an interpolation theorem allows us to guess the "intermediate" estimates 
between two closely-related inequalities. To give an elementary example, we 
take a square-integrable function / on the real line. It is a standard result 
from real analysis that / satisfies the L 2 -Hdlder inequality 

f roo \ 1/2 / poo \ 1/2 

\f(x)g(x)\ dx < ij \f(x)\ 2 dx\ (J \g(x)\ 2 dx 

for every integrable function g on the real line with compact support. If, in 
addition, / satisfies the integral inequality 

(poo \ 1/2 / poo \ 

/ \f(x)\ 2 dx) / \g(x)\dx) (2) 
J —CO J \J — CO J 

for all such g, it then follows "from interpolation" that the inequality 

(poo \ 1/2 / poo \ i/p 

J \f(x)\ 2 dx\ (j \g(x)\*dx) (3) 

holds for all 1 < p < 2 and all g on the real line with compact support. 

From a more abstract viewpoint, we can consider interpolation as a tool 
that establishes the continuity of "in-between" operators from the continuity 
of two endpoint operators. The above example can be viewed as a study of 
the "multiplication by /" operator 

(Tg)(x) = f(x)g(x). 

In the language of Lebesgue spaces, inequality Q implies that T is a contin- 
uous mapping from the function space L 2 to itself, and inequality ^ implies 
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that T is a continuous mapping from L? to another function space L 1 . The 
conclusion, then, is that T maps L 2 continuously into the "interpolation 
spaces" W (1 < p < 2) as well. 

Presented herein are a study of four interpolation theorems, the requisite 
background material, and a few applications. The materials introduced in 
the first three sections of Chapter 1 are used to motivate and prove the 
Riesz-Thorin interpolation theorem and its extension by Stein, both of which 
are presented in the fourth section. Chapter 2 revolves around Calderon's 
complex method of interpolation and the interpolation theorem of Fefferman 
and Stein, with the material in between providing the necessary examples 
and tools. The two theorems are then applied to a brief study of linear 
partial differential equations, Sobolev spaces, and Fourier integral operators, 
presented in the last section of the second chapter. 

I have approached the project mainly as an exercise in expository writing. 
As such, I have tried to keep a real audience in mind throughout. Specifi- 
cally, my aim was to make the present thesis accessible to Rutgers graduate 
students who have taken Math 501, 502, and 503. This means that I have 
assumed familiarity with the standard material in advanced calculus, com- 
plex analysis, linear algebra and point-set topology. In addition, I expect 
the reader to be conversant in the language of measure and integration the- 
ory including Lebesgue spaces (L p spaces), and of functional analysis up to 
basic Banach and Hilbert space theory. Beyond those, the required tools 
from functional analysis are summarized in the beginning of Chapter 2, and 
elements of harmonic analysis are introduced throughout the thesis. 

Before I realized how much time it would take to develop each topic 
at hand, I had planned to include some harmonic function theory, maxi- 
mal function theory of Hardy and Littlewood, the interpolation theorem of 
Marcinkiewicz, the standard material on the theory of singular integral oper- 
ators, and the Lions-Peetre method of real interpolation as a generalization 
of Maricnkiewicz. This never happened, and what I had in mind is reduced 
to a brief exposition in the further-results section of Chapter 2. Of course, 
given the length of the present thesis as is, I simply would not have had the 
time and energy to give the extra materials the care they deserve. 

Nevertheless, the inclusion of the theory of singular integral operators 
would have helped motivating the section on Fefferman-Stein theory in Chap- 
ter 2, which I believe is extremely condensed and, frankly, dry as it stands 
now. Moreover, I was not able to come up with a coherent narrative for the 
section on the functional-analytic prerequisites in the beginning of Chapter 2. 



Is there any way to make a "random collection of things you should probably 
know before reading" section flow pleasantly smooth without expanding it 
into a whole chapter or a book? I do not have a good answer at the present 
moment. 

But, enough excuses. I had a lot of fun writing this thesis, and I hope 
that I managed to produce an enjoyable read. Please feel free to send any 
comments or corrections to markhkim@dimax . rutgers . edu. 
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Chapter 1 

The Classical Theory of 
Interpolation 



In the first chapter, we study two interpolation theorems, both of which are 



presented in §1.4| Interpolation theory began with a 1927 theorem of Marcel 
Riesz, first published in |Rie27bj . Riesz convexity theorem, as it is called, 
did not arise as a theorem of harmonic analysis, as the paper dealt with the 
theory of bilinear forms. It was Riesz's student G. Olof Thorin with his thesis 
|Tho48] who appropriately generalized the theorem of Riesz and placed it in 
its proper context. The complex- analytic method used in the proof of the 
Riesz-Thorin interpolation theorem was then generalized by Elias M. Stein, 
allowing for interpolation of families of operators. This result, known as the 
Stein interpolation theorem, was included in his 1955 doctoral dissertation 
and was subsequently published in [Ste56j. 

The first two sections of the chapters are devoted to developing the nec- 
essary tools for stating and proving the interpolation theorems. We review 
the theory of measure and integration in the first section, which is included 
mainly as a convenient reference. In the second section, we tackle approx- 
imation theorems in Lebesgue spaces, which provide a convenient way of 
studying function spaces by focusing on small samples of functions. We then 
switch gears and present the basic theory of Fourier transform in the third 
section. This serves primarily to motivate the Riesz-Thorin interpolation 
theorem and to provide a useful example to which the theorem can be ap- 
plied. The chapter culminates in the fourth and the last section, in which 
we state and prove the Riesz-Thorin interpolation and its generalization by 
Stein. 
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2 CHAPTER 1. THE CLASSICAL THEORY OF INTERPOLATION 

1.1 Elements of Integration Theory 

We begin the chapter by collecting the necessary facts from measure and 
integration theory. The present section is meant to serve only as a quick 
reference, and so the details will necessarily be sparse. See |SSllj . [EB05J, 
|Fol99] . |Rud86 j . or any other standard textbook on the subject for a more 
detailed treatment. 

1.1.1 Measures and Integration 

Recall that a a-algebra on a nonempty set X is a collection Wl of subsets of 
X such that 

(a) G Wl and X e Wl. 

(b) If {E n )™ =l is a sequence in Wl, then \J n E n G Wl. 

(c) If E G 971, then X \ E G Wl. 
Note that (b) and (c) imply 

(d) If 1 is a sequence in Wl, then p| n E n G 971. 

The pair (X, Wl) is referred to as a measurable space. Given a measurable 
space (X, 9Jt), we say that a subset of X is measurable if it is an element of 
Wl. A measure on (X, 9JT) is a function /i : Wl — > [0, oo] that is countably 
additive, viz., 



for every pairwise disjoint sequence (E n )™ =1 of measurable sets. Every mea- 
sure \i on X satisfies the following properties: 

(a) fi(0) = 0; 

(b) Monotonicity. If E and F are measurable subsets of X and if E C F, 
then //(£?) < 

(c) Countable subadditivity. If (-E' n )^L 1 is a sequence of measurable sub- 
sets of X, then 
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(d) Continuity from below. If E ± C E 2 C E 3 C • • • is a sequence of 
measurable subsets of X, then 



(e) Continuity from above. If E\ D E 2 2 E 3 ■ ■ ■ is a sequence of measur- 
able subsets of X such that fx(Ei) < oo, then 



Given a nonempty set X, a a- algebra 97t on X, and a measure /x on the 
measurable space (X, 971), we refer to triple (X, 971, //) as a measure space. 
A measure space (X, 97t, //) is said to be /irwte if /i(X) < oo, a -finite if 
there exists a sequence (i? n )^ 1 of finite-measure sets whose union is X, 
and complete if all subsets of measure-zero sets are measurable. We often 
talk about a finite measure, a a-finite measure, or a complete measure: this 
usage introduces no ambiguity, as specifying a measure picks out a unique 
a-algebra as its domain, and this a-algebra, in turn, determines a unique 
base set. Similarly, we usually speak of measures on the base set X, even 
though the measures are, strictly speaking, defined on measurable spaces. 

If X is a topological space, then we define the Borel a-algebra to be the 
smallest a-algebra on X containing all open subsets of X. A Borel set in X is 
an element of the Borel a-algebra on X, and a Borel measure on X is a mea- 
sure on X that renders all Borel sets measurable. The canonical measure on 
R rf , the d- dimensional Lebesgue measure, is the unique complete translation- 
invariant Borel measure ££ d on R d with the normalization J2? d ([0, l] d ) — 1. If 
there is no danger of confusion, m(E) or \E\ is often used in place of J£ d (E). 
We shall have more to say about the Lebesgue measure later in this section. 
For now, we merely remark that the Lebesgue measure is a-finite. 

Given a measure space (X, 97T, /i) and a topological space Y , we say that 
a function / : X — > Y is measurable if each open set E in Y has a measurable 
preimage f~ 1 (E). If X is a topological space and /x a Borel measure, then the 
definition renders all continuous functions measurable. If Y is R or C, then 
the sums and products of measurable functions are measurable. We observe 
that the supremum, the infimum, the limit superior, and the limit inferior of a 
sequence of measurable functions is measurable. This, in particular, implies 
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that the limit of a pointwise convergent sequence of measurable functions 
is measurable. In fact, if the set of divergence is of measure zero, then this 
continues to hold. In other worlds, the limit of a pointwise almost- everywhere 
convergent sequence of measurable functions is measurable. We say that a 
property P holds almost everywhere if the set on which P does not hold is 
of measure zero. 

Let (X, 971, ji) be a measure space. The characteristic function, or the 
indicator function, of E C X is defined to be 



1 if x E E; 
if x G X \ E. 



Xe(x) 

A simple function s on X is a finite linear combination 



N 
n=l 



of characteristic functions, where each A n is a complex number and E n a 
measurable set. Note that simple functions are automatically measurable. 
The (Lebesgue) integral is defined to be the sum 

/ sdfi= / s(x) d/i(x) = V* \ n n(E n ). 

J Jx n=l 

We extend the definition of the integral to nonnegative measurable func- 
tions / on X by setting 

j f d/i — J f(x) d/i(x) = sup |y s d/i : s is simple and < s < / j- 

and call / integrable if the integral is finite. With this definition, we can state 
one of the fundamental theorems in measure theory, the monotone conver- 
gence theorem: every increasing sequence (f n )^=i of nonnegative integrable 
functions on X converging pointwise almost everywhere to a function / on 
X satisfies the identity 

lim / f n d\i= \ lim f n d\i= \ f d\i. 



n— >oo 



The theorem allows us to approximate the integral of nonnegative measur- 
able functions by integrals of simple functions. Indeed, every nonnegative 
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measurable function f on X admits an increasing sequence (s n )^ =1 of non- 
negative simple functions that converge pointwise to / and uniformly to / 
on all subsets of X on which / is bounded. For non-increasing sequences of 
functions, we have Fatou's lemma, which states that every sequence (f n )^ = i 
of nonnegative measurable functions on X satisfies the inequality 

lim inf /„ dfj, < lim inf / f n d\x. 

Before we extend the definition of the integral to general cases, we take a 
moment to tackle a minor technical issue. Functions like f(x) = x~ 1//2 X[o,ii 
are "integrable over R" and have finite integrals, but they are not functions 
on R in the traditional sense, for x = must be excluded from the domain. 
In order to incorporate such functions into the framework of Lebesgue inte- 
gration, we ought to turn them into measurable functions on their natural 
"domain space" . The solution is to consider the extended number system R, 
which consists of the real numbers, the negative infinity — oo, and the posi- 
tive infinity oo. We define the arithmetic operations on R by inheriting the 
operations from R and then by setting 

x 

x ± oo = ±oo, - — = 0, y ■ (±oo) = ±oo, (—y) • (±oo) = =poo 
±oo 

for all x G R and y G R \ {0}; we do not attempt to define oo — oo. In 
measure theory, we typically set 

• ±oo = 0, 

so that the values of an extended real-valued function on a set of measure 
zero are negligible. We say that a function / : X — Y R is measurable if 
oo,a)) is measurable in X for each o£l. With the standard topol- 
ogy on R, this definition agrees with the standard definition of measurable 
functions given above: see § §1.5.1 for a discussion. 



We now fix an arbitrary measurable extended real-valued function / on 
X and define 

f + (x) = max{/(x), 0} and f (x) = max{— f(x), 0}. 

/ + and /~ are nonnegative, measurable, extended real-valued functions, and 
so we can define the integrals j f + and J f~ by a simple modification of the 
definition of integral for nonnegative real-valued functions. Since / can be 
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written as the difference f + — f , it is natural to define the integral of / to 
be 

J fdfi = J f(x)dfi(x) = J f + dfi- J f'd/i, 

provided that the difference is well-defined. We say that / is integrable if 
and only if the integral of / is finite. 

If / is complex-valued, we use the decomposition 

/ = Re/ + - Re/" + i(lmf + - Imf) 

to define the integral of / to be 



L 



f(x)d/i(x) — J Ref + dfi — J Re f d/i+i (^J Im / + dji — J Im/ d/i^j , 



ix 
where 

Ref + (x) = max{Re/(a;),0}; 

Ref~(x) = max{— Re f(x), 0}; 

Im/ + (a;) = max{Im/(x), 0}; 

lmf~(x) = max{— Im f(x), 0}. 

Again, the integral of / is defined only when the above sum of integrals is 
well-defined, and we say that / is integrable if the integral of / is finite. The 
main convergence theorem for this definition is the dominated convergence 
theorem: a sequence (f n )^=i of measurable functions converging pointwise 
almost everywhere to / and satisfying the bound \f n \ < g almost everywhere 
with an integrable function g satisfies the following identity: 



lim / f n dfi= / lim f n d^= / / dfi. 

n— >oo J J n->oo J 

Instrumental in proving the aforementioned convergence theorems are the 
following basic properties of the integral: 

( a ) /(/ + g)dfi = J fdfi + J g dfi. 

(b) f (A/) d/j, = A J f d\x for each complex number A. 

(c) If / < g, then J f dfi < J g dfi. 

(d) \ ffdiM\<f\f\diM. 



1.1. ELEMENTS OF INTEGRATION THEORY 



7 



(e) If / = almost everywhere, then f f\E d\i = for all E. 

(f ) If n(E) = 0, then / f X E dfi = for all /. 

(a) and (b) imply that the integral is a linear functional on the Lebesgue space 
L p (X,fi), which we shall define in due course, (e) and (f) can be rephrased 
in terms of integrating over subsets: if / is a complex-valued measurable 
function on X and E a measurable subset of X, then the integral of f over 
E is 

/ fdfi= / fxE dfi. 

J E JX 

(d) implies that the integrability of |/| establishes the integrability of /. In 
fact, a simple computation shows that the converse is true as well. 



1.1.2 LP Spaces 

In light of the above observation, we see that the collection L l (X,fi) of 
complex- valued measurable functions / on X such that J* ]/| d/j, < oo collects 
all integrable complex- valued functions on X. We thus define the L l -norm 
H/lli of / e L 1 (X, yu) to be the integral f \f\ dji. Note that the L 1 -norm is 
not a norm as it is, since functions that are zero almost everywhere still have 
the L 1 -norm of zero. To rectify this issue, we consider L X (X, fx) to be the 
quotient vector space defined by the equivalence relation 

/ ~ g f — g almost everywhere, 

at which point the L 1 norm becomes a bona fide norm on L 1 (X, //). 

We pause to make two remarks. Note first that every integrable function 
must be finite almost everywhere, whence each extended real- valued inte- 
grable function is equal almost-everywhere to a complex-valued integrable 
function. Therefore, extended real-valued integrable functions can be put in 
L 1 (X, [/,) without disrupting the complex- vector-space structure thereof. We 
also point out that the equivalence-class definition provides no real benefit 
beyond resolving a few technical issues. Therefore, we shall be intentionally 
sloppy and speak of functions in /i), unless structural nit-picking is 

necessary. 

Endowing L 1 (X, //) with the corresponding norm topology, we can now 
consider the dominated convergence theorem as a sufficient condition for 
turning pointwise almost-everywhere convergence of integrable functions into 
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convergence in the L 1 -norm. We also have a partial converse, which states 
that every sequence of integrable functions converging in the L 1 -norm admits 
a subsequence, with a dominating function in L l , that converges pointwise 
almost everywhere. We note that the L 1 -metric 



is complete, so that L 1 (X,fi) is a Banach space, a normed linear space whose 
norm-induced metric topology is complete. 

It is also useful to consider the space L 2 (X,fi) of square-integrable func- 
tions on X, with the quotient-space construction as above to avoid technical 
problems. The bilinear form 



is an inner product on L 2 (X, /i), which is well-defined by the Cauchy-Schwarz 
inequality: 



which furnishes a complete metric. Therefore, L 2 (X, fi) is a Hilbert space, an 
inner product space whose norm-induced metric topology is complete. Even 
better, if we set X to be the Euclidean space M. d and \x the <i-dimensional 
Lebesgue measure, then the corresponding L 2 -space is separable, viz., it con- 
tains a countable dense subset. Since all separable Hilbert spaces are unitarily 
isomorphic to one another, L 2 is, in a sense, the Hilbert space. 

Recall that a linear functional on a real or complex vector space V is 
a linear transformation on V into the scalar fielcfj] F, which is taken to be 
either K or C. If V is a normed linear space, a linear functional I on V is 
bounded in case it admits a constant k such that 



d L i(f,g) = 11/ -p||i 






1/2 



\lv\ < k\\v\\v 




1 Since we primarily work over the complex field C in the present thesis, we will not 
retain this level of generality for the rest of the thesis. One exception occurs in §2.1[ where 
we consider real vector spaces and complex vector spaces separately. 
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for all v & V. We note that I is bounded if and only if I is continuous with 
respect to the norm topology of V. The collection V* of bounded linear 
functionals on V forms a vector space, called the dual space of V. It is a 
standard result in real analysis that V* is a Banach space with the operator 
norm 

\\l\\v = sup \lv\, 
\M\<1 



which, in turn, is the infimum of all possible k in (1.1). 

Since many transformations of functions that arise in mathematical anal- 
ysis can be understood as bounded linear functionals on function spaces, it is 
of interest to describe them as concretely as possible. A common approach, 
known as a representation theorem, is to determine the obvious bounded lin- 
ear functionals on the given function space, and then to investigate the extent 
in which arbitrary bounded linear functionals can be represented by the ob- 
vious ones. For L 2 , we have a wonderfully concrete representation theorem, 
due to Frigyes Riesz: 

Theorem 1.1 (F. Riesz representation theorem, Hilbert-space version). If 

% is a Hilbert space, then each bounded linear functional I : H — > C admits 
a unique element u G H such that 

lv = (v,u) n 
for all v G %. Moreover, \\l\\u* = \\ u \\u- 

It follows that we can identify each element of W* with an element of H. In 
particular, we conclude that 

(L\X,n))* = L\X,n) 

in light of the above identification. 

Having considered L 1 and L 2 , we now define, for each p e [l,oo), the 
Lebesgue space L p (X,fi) of order p on X by collecting the complex- valued 
measurable functions / on X such that 

i/p 



J x \f\ P d^j P <oo. 



The standard quotient construction is applied here as well, turning || • || p into 
a norm. With the language of Lebesgue spaces, Holder's inequality can be 
stated succinctly as 

Wfgh < WfWU 
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where p > 1 and p' is the conjugate exponent 

P 



P 



p — 1 



of p. Note that 1/p + 1/p' = 1. 

Note that if / E L 1 (X,fi) and g is bounded, then 

ll/0l|i< ||/||isup|(/(x)|. 

Expanding on this idea, we introduce the space L°°(X,fi) of complex- valued 
measurable functions f on X whose essential supremum 

imU = inf{AGM:M{x:|/(x)|>A}) = 0} 

is finite. The space L°°(X, //) can be considered as a "limiting space" of 
L p (X,ii), for if / € L°° is supported on a set of finite measure, then f E L p 
for all p < oo and 

lim ||/L=||/iU- 

p— >-oo 

We remark that Holder's inequality holds for p = 1 as well, with the identi- 
fication l/oo = to yield p' = oo. 

Given p e [1, oo], Minkowski's inequality establishes the triangle inequal- 
ity for || ■ ||p, thus turning || • \\ p into a norm on L p (X,fi). Moreover, the 
Riesz-Fischer theorem guarantees that L p (X,fi) is a Banach space. A par- 
tial converse to the L p dominated convergence theorem continues to hold, 
so that a sequence of functions converging in the L p norm admits a point- 
wise almost-everywhere convergent subsequence with a dominating function 
in L p , continues to hold. 

Observe, however, that the dominated convergence theorem fails to hold 
on L°°. The L p representation theorem for L p , which yields the identification 



= L p , also fails to hold for p = oo: see §^ 1.5.2 We shall have more to 



say about the representation theorem in the next subsection. 



1.1.3 cr-Finite Measure Spaces 

In this subsection, we review three major theorems of measure and inte- 
gration theory that requires the cr-finiteness hypothesis. The first is the LP 
representation theorem, as was alluded to above: 
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Theorem 1.2 (F. Riesz representation theorem, L p -space version). Suppose 
that (Jf, 97T, //) is a a -finite measure space. If p G [l,oo) ; then each bounded 
linear functional I on L p (X,fi) admits a unique linear function u E L p (X,fi) 
such that 

1(f) = J fudfi (1.2) 

for all f G L P (X, (i). Moreover, ||^||(lp)* = |M| L ,' ; whence (L p )* is isometri- 
cally isomorphic to LP . 

It is an easy consequence of Holder's inequality that every function of the 



form (1.2) is a bounded linear functional on L p . The representation theorem 



states that linear functionals of the form (1.2) are, in fact, all bounded linear 
functionals on L p . 

Since the proof of the representation theorem makes use of a few key 
notions that we shall need in later sections, we study it in detail. To this 
end, we fix a measurable space (X, OJt) and recall that a function v : 97t — > C 
is a complex measure if, for each E G 97T and every countable partition 
{E n : n G N} of E in 97t, the function v is countably additive, viz., 

oo 

!/(£)= 5>(tf B ). 
n=l 

Note that the definition forces \i[X) < oo. 

We sometimes use the name positive measures for measures proper in 
order to distinguish them from complex measures. In fact, there is a canonical 
way of assigning a positive measure corresponding to each complex measure 
v: the total variation of v is the positive measure \v\ defined to be 

oo 

M(£) = su P y>(£ n )i 

for each E G 971, where the supremum is taken over all partitions {E n : n G 
N} of E belonging to 97t. Recalling that a measure v, complex or positive, on 
(X, 97T) is said to be absolutely continuous with respect to a positive measure 
fi on (X, 971) if v(E) =0 for all E G 97t such that /i(E) = 0, we see that v is 
absolutely continuous with respect to \v\. In general, we write 



to denote the absolute continuity of v with respect to ji. 
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A polar opposite notion to absolute continuity is defined as follows: two 
measures v\ and z/ 2 , positive or complex, are said to be mutually singular if 
there exists a disjoint pair of measurable sets A and B such that 

u 1 (E)=v 1 (AnE) and v 2 (E) = u 2 (B n E) 

for all E em. We write 

v\ -L v 2 

to denote the mutual singularity of v\ and z/ 2 . 

We are now ready to state the second theorem of this section, which is 
the main ingredient of the proof of the representation theorem. 

Theorem 1.3 (Lebesgue-Radon-Nikodym). Let (X, 971) be a measurable 
space, v a complex measure, and fi a positive a-finite measure. Then there 
is a unique pair of complex measures u a and v s such that 

V = U a + V s , V a < 11, V s ± fi, 

and there exists a u G L 1 (X, /i) such that 

v a (E) = udfi 

J E 

for all E G 971. Any such function agrees with u almost everywhere on X . 

Two remarks are in order. First, if v is a positive finite measure, then so 
are v a and v s . Second, if v <^ /i, then du = udfi for an L 1 function u defined 
uniquely almost everywhere. This u is called the Radon- Nikodym derivative 
and is denoted by ^, so that 

dv = —dfi. 
dfx 

Having stated the Lebesgue-Radon-Nikodym theorem, we proceed to the 
proof of the L p representation theorem. In what follows, we use the complex 
signum function 

/if, if^GCx{0}; 
sgn z — < 1 1 

if z = 0. 
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Proof of Theorem We first claim that the norm of u G L p ' (X, /i) can be 
computed by the identity 



\u\y 



sup 

l/llo<l 



fu dfi 



;i.3) 



Note first that 



\fu\ dfi < \\f\\p\\u\\ p/ < \\u\\ p/ 



by Holder's inequality, so long as \\f\\ p < 1. If p > 1, then we set 



f(x) = \u(x)\ 



u 



and observe that 



fu dfi 



it 



— j- / \u(x) \ p = \\u\\ p >. 



Since ||/|| p = 1, the claim follows. If p = 1, then we fix e > and invoke the 
cr-finiteness of fi to find a set E of finite positive measure on which 



\u{x)\ > \\u\ 



e. 



We then set 



/(*) 



Xe{x) sgnu(x) 



and observe that ||/||i = 1 and 



/ f ' U =^E)S E H ^- M -~ £ - 



Since e > was arbitrary, the claim follows. 

We now establish a converse to Holder's inequality: namely, if u is a mea- 
surable function that is integrable on all sets of finite measure and satisfies 
the bound 



sup 
IMIp<i 
simple 



su 



k < oo, 
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then u G L p ' and \\u\\ p i = k. To this end, we recall that there exists a 
sequence (u n )^ =1 of simple functions such that \u n \ < \u\ almost everywhere 
and u n — > g pointwise almost everywhere. If p > 1, then we set 



f n (x) = \u n (x)\ p — ||p/ _ 1 
for each n G N and observe that 



\Un\\ p i 



k 



sup 

1 1 ^ 1 1 r> < 1 

simple 



su 



> 



f n U djJL 



f \u n (x)\ p ' djj, 



u.,. 



I p'-l 



\U>n ||p'- 



Fatou's lemma implies that \\u\\^, < k p \ and Holder's inequality establishes 
the reverse inequality, verifying the claim. If p = 1, then we fix e > and let 

E = {x : |u(a;)| > k + e}. 

Assume for a contradiction that fJ>(E) > 0, and invoke the a-finiteness of fi 
to find a set F of finite positive measure contained in E. We set 



/(*) 



XF(x)sgnu(x) 



and observe that 
k = 



sup 

|| s || i < 1 

s simple 



su 



> 



fu dp, 



j F \u\ dp 



p(F) 



>k + e, 



which is absurd. It thus follows that ||m||oo < k, and the reverse inequality is 
established by Holder's inequality. 

Let us now return to the proof of the theorem. Assume for now that p 
is a finite measure on X, so that xe G L p (X,p) for every measurable set E. 
Fix a bounded linear functional I on L p (X,p) and set 



v(E) = Kxe) 



for each measurable set E. We claim that v is a complex measure on (X, 971) 
that is absolutely continuous with respect to p. To see this, we first note 
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that the linearity of tp establishes the finite additivity of v. Given a pairwise 
disjoint sequence (E n )^ =1 of measurable sets, we set 



E = (J E n and F N = [j E n 



n=l n =N+l 

for each iVGN. Observe that xe = (XEi + ' ' ' + Xe n ) + Xf n , and so 



Since 



HF)\ = \1( X f)\ < \\1\\(l«)*\\Xf\\ p = (M^)) 1/P , (1-4) 

for every measurable set F, we see that u(F N ) — > as N — > oo. Therefore, 



v is countably additive, and (1.4) shows that v <C /i. 



We now invoke the Lebesgue-Radon-Nikodym theorem to find the unique 
it G L 1 (X, /i) such that 

f(-B) = udfj, 
Je 

for all measurable sets i?. Therefore, 



XEud/ji 



and the linearity of the integral implies that 



l(s) = J sudfi 



for each simple function sonl. Recalling that every LP function can be 
approximated by simple functions, we conclude that 

<f{f) = / f ud V 



for all / G L p (X,fi). Furthermore, we have 



\u\\pi = sup 

ll/L<i 



fu dfj, 



SUP = ||/||(LP)* 

ll/llp<l 



by formula (1.3). This establishes the theorem for fi(X) < oo. 



16 



CHAPTER 1. THE CLASSICAL THEORY OF INTERPOLATION 



We now lift the assumption that [i is finite. By the cx-finiteness of /i, we 
can find an increasing sequence (E n )^ =1 of finite-measure sets whose union 
is X. On each E n , we invoke the representation theorem for finite measures 
to find an integrable function u n on E n such that 



KfXEj = / fUndfl 

for all / G L P (X, //). We extend u n onto X by setting it to be zero onX\i? n 
and invoke the converse of Holder's inequality to see that 

\\ u n\\q < Hi || (LP)*- 

Note that (m„)~ =1 is a pointwise almost-everywhere convergent sequence 
of integrable functions. We set the limit to be u and apply Fatou's lemma 
to conclude that 

\\u\\ q < ||/||(Lf)*- 

It now follows that 

KfXE n ) = J fXE n udfx 
for each / G L p (X,/i) and every n G N, whence taking the limit yields 

/(/) = J fudlJL. 

We now apply Holder's inequality to establish the reverse inequality 

\\u\U > \\l\\w, 

and the proof is complete. □ 

Finally, we review integration on product spaces. Given two measure 
spaces (X, 971, /i) and (Y, 9T, v), we define the product a -algebra 9JT ® 9T to be 
the smallest cr-algebra containing the collection 

{E x F : E G Ti and F G 01}. 

of measurable rectangles. It is a standard fact that the set function 

((ixv)(ExF)=[i(E)v(F), 
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initially denned on the collection of measurable rectangles, can be extended to 
a measure on (X x Y, 9Jl<g>01), forming a measure space (X x Y, 971® 01, /x x v). 

HECXxY,xEX, and y G Y, we define the x-section E x and the 
y-section E y of i£ as follows: 

= {(,'gF: (x, i/') G and E y = {x' G X : (V, y) G £}. 

Analogously, given a function / : X x K — > C, we define the x-section f x 
and the y-section f y of / as follows: 

fM = F(x) = f(x,y). 

The main theorem, due to Guido Fubini and Leonida Tonelli, gives sufficient 
conditions for which the order of integration may be exchanged: 

Theorem 1.4 (Fubini- Tonelli). Let (X, 9JI, fi) and (Y, 01, v) are a -finite mea- 
sure spaces. 

(a) Tonelli's theorem. If f is a nonnegative integrable function on X x 
Y , then the functions x t- >■ J f x dv and 1/ 4 J f y dfi are nonnegative 
integrable functions on X and Y , respectively, and 



fd(fixi/)= / / f(x,y)du(y)dfi(x) = / / f(x,y)dfi(x)du(y). 

XxY JXJY JYJX 

(b) Fubini's theorem. If f is integrable on X xY, then f x is integrable on 
Y for almost every x G X, f y is integrable on X for almost every y G Y , 
the function x J f x dv is integrable on X, the function y \-> f f y dfi is 
integrable on Y , and 

\ fd(fj,xv)= / / f(x,y)dv(y)dfi(x) = / / f(x,y)dfj,(x)du(y). 

J XxY J X JY JY J X 



1.1.4 The Lebesgue Measure 

We conclude our review by presenting a rapid treatment of the basic proper- 
ties of the canonical measure on the Euclidean space, the Lebesgue measure. 
We adopt a particularly constructive approach from [SS05], hinging on a 
decomposition theorem of Hassler Whitney. In what follows, a cube is an 
n-fold product of closed intervals of the same length, and two cubes in W 1 
are almost disjoint if their interiors are disjoint. 
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Theorem 1.5 (Whitney decomposition theorem). Every open set in IR d can 

be decomposed into a union of countably many almost- disjoint cubes. 

Our version of the theorem omits the estimate on the sizes of the cubes. 
See § §1.5.3 for the precise version. We shall have more occasions to use the 



decomposition theorem, so we present a full proof of the theorem. 




Figure 1.1: A Whitney decomposition of a two-dimensional figure 



Proof. Let O be an open subset of M, d . For each n, we consider the grid 
formed by cubes of side length 2~ n , whose vertices have coordinates in 

2- n Z d = {(k u ...,k d ): Tkj G Z for all 1 < j < d}. 

Note that the grid formed at the nth stage is obtained by bisecting the cubes 
that formed the grid at the {n — l)th stage. We define C n to be the collection 
of all such cubes, of side length 2~ n , that intersect O. Note that [JC n is 
a countable collection of cubes, and that the union of all cubes in each C n 
contains O. 

We now extract a collection C of almost-disjoint cubes from |J C n as fol- 
lows. We begin by declaring every cube in C\ to be a member of C. For each 
n, we throw away all cubes in C that intersect nontrivially with some cubes 
in C n and add in all cubes in C n that are almost disjoint from every remaining 
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cube in C. The resulting collection C clearly consists of almost-disjoint cubes. 
Since C C [jC n , the collection is countable as well. 

It now remains to show that the union of all cubes in C is O. Since the 
union evidently contains O, it suffices to show that no point in R d \ O is 
covered by C. Let x be such a point, and suppose for a contradiction that 
there is a cube Q G C containing x. This, in particular, implies that Q % O. 
Fix y E Q (10. Since O is open, a sufficiently large integer N guarantees 
that the grid at the iVth stage admits a cube Qf of side length 2~ N that 
contains y and is contained entirely in O. But Qf is a smaller cube than Q 
that intersects Q nontrivially, whence Q cannot be an element of C in the 
first place. This is absurd, and the proof is now complete. □ 

The decomposition provides a natural way of assigning a volume to each 
open set in IR n : we look at the sum of the volumes of the cubes in each 
Whitney decomposition and take the infimum as the volume of the open 
set. We then define the Lebesgue outer measure m* of an arbitrary subset to 
be the infimum of the volumes of the open supersets of the set. To ensure 
countable additivity, we restrict the outer measure to the subsets E of W 1 
such that each e > admits an open superset O of E with the estimate 
m*(0 \ E) < e. Such a set is called a Lebesgue-measurable set, and the 
restriction of m* onto the collection of Lebesgue-measurable sets is referred 
to as the Lebesgue measure. We denote the Lebesgue measure of E by m(E), 
or \E\ if there is no danger of confusion. 

Before we review the basic properties of the Lebesgue measure, we remark 
that any reference to measurability of subsets of M, d or functions on IR d in 
this thesis shall be for the Lebesgue measure, unless otherwise specified. We 
now recall that the Lebesgue measure of an arbitrary measurable set can be 
approximated by that of open sets and closed sets: 

Proposition 1.6. If E is a measurable subset ofW 1 , then each e > has a 
corresponding closed set F C E and an open set O ^ E such that \E\F\ < e 
and \0 \ E\ < e. If \E\ < oo, we may take F to be a compact set. 

It follows from the above proposition and the continuity of measure that 
the Lebesgue measure is Borel regular: m is a Borel measure, and each 
measurable subset E of M, d has a corresponding Borel subset B of M d such that 
E C B and \E\ = \B\. Even better, it turns out that countable intersections 
of open sets, known as G$ sets, and countable unions of closed sets, known 
as F a sets, are quite enough: 
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Proposition 1.7. E C R d is measurable 

(a) if and only if there exists a G$ set GCK' i such that \G \ E\ — 0; 

(b) if and only if there exists an F a set F C R d such that \E \ F| = 0. 

The Lebesgue measure behaves well under linear endomorphisms on R d : 

Theorem 1.8. If E C R d is measurable and T : R d — >■ lR d a /mear transfor- 
mation, then 



This, in particular, implies that the Lebesgue measure is invariant under 
translation and rotation, and scales in tune with the usual geometric intuition 
under dilation. 

We frequently denote the integral of a measurable function / : R d — > C 
with respect to the Lebesgue measure by 



We also recall that the change-of-variables formula continues to hold for 
the Lebesgue integral: 

Theorem 1.9 (Change-of-variables formula). If O is an open subset ofR d 
and (ft : O M. d an injective differentiate function, then, for each f e L l {0), 
we have / o^g L 1 ((f)(0)) and 



m(T(E)) = \detT\m(E). 




instead of the more cumbersome 





where D(p(x) is the total derivative of (p at x. 
This, in particular, implies that 
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for all h e R d and 5 > 0. 

With this, we conclude the review. We refer the reader to § £ 1.5.4 for a 
discussion of some other nice properties of the Lebesgue measure. 
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1.2 Approximation in L p Spaces 



The central objects of study in interpolation theory are function spaces and 
linear operators between function spaces. Typically, the function spaces are 
vector spaces equipped with topologies that are compatible with the vector- 
space structure. We can then require the operators to be continuous, so as to 
have them behave well under various limiting processes. Recall that a linear 
operator T : V — > W between normed linear spaces V and W is bounded if 
there exists a constant k > such that 



for all v G V. It is easy to show that T is bounded if and only if T is continu- 
ous with respect to the norm topologies of V and W, and that the collection 
j£f (V, W) of bounded linear operators from V to W with the operator norm 



is a Banach space if W is a Banach space. 

It is, however, cumbersome to specify the value of an operator at all points 
on its domain. We therefore seek to find a suitable subset of the domain that 
is essentially the whole space. 

Definition 1.10. A subset D of a topological space X is dense if the closure 
of D is in X is X. 

As it turns out, it is enough in many cases to specify the value of an 
operator on a dense subset of its domain. Even better, the extension is 
norm-preserving. 

Theorem 1.11. Let V and W be normed linear spaces and D a dense linear 
subspace of V . If W is a Banach space and T : D — >■ W is a bounded linear 
operator, then there exists a unique linear operator 7\ : V — >■ W such that 
||Ti||v-».w = H^IId-^ and Ti\ D = T. 

Proof. For each v G V, we find a sequence (v n )^ =1 in D that converges to v. 
Since T is bounded, (Tv n )^ =1 is a Cauchy sequence in W , whence it converges 
to a vector w v G W. If (^)^Li is another sequence in D that converges to 
v, then 



\\Tv\\ w < k\\v\\ v 



\\T\\v^w 



sup ||Tv|| w 

IMIw<i 




'n 



~ W. 



v 
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and so (Tv' n )^ =1 converges to w v as well. The operator 

T lV : 



Tv if v e D 
w v if v e V \ D 



is therefore well-defined, and its linearity is a trivial consequence of the lin- 
earity of T. Furthermore, 



||Ti^|| = lim ||Tii> n || = lim ||Tv n || < lim ||T||||t> n || = ||T||||t> 

n— >oo n— >oo n— !>oo 

for each v G V, so that 1 1 T\ 1 1 < ||T||. Since 

||Ti|| = sup ||Tif || > sup \\Tiv\\ = sup \\Tv\\ = \\T\\, 

\\v\\<l \\v\\<l \\v\\<l 

veD veD 

we have 1 1 T\ 1 1 = ||T||, as was to be shown. □ 

1.2.1 Approximation by Continuous Functions 

It is therefore useful to have several examples of dense subspaces of frequently 
used function spaces. We know, for example, that we can approximate in- 
tegrable functions by simple functions, whence the space of simple functions 
is dense in L 1 . Moreover, we can approximate just as well if after restricting 
ourselves to simple functions on sets of finite measure. 

Proposition 1.12. Let (X,9Jl,/i) be a a-finite measure space. The space of 
simple functions with finite-measure support is dense in L 1 (X, yu). 

Proof. Let (X n )^ =1 be an increasing sequence of finite-measure subsets of 
X whose union is X. Given an / e L 1 (X, /i), the monotone convergence 
theorem implies that 

lim ||/-/Xn||=0. 
Therefore, each e > admits an integer N such that 

/ \f ~ fXx N \dfx < e. 
Jx 

We now find a sequence (s n )^ =1 of simple functions in L 1 (X N ,fi) such that 
\s n \ ^ \f'Xx N \ almost everywhere and s n — > f pointwise almost everywhere. 
Arguing as above, we can find an integer M such that 



J Xjv 



\sm - fXx N \d/2 < e. 
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Extending % onto X by denning %(x) = on 1 \ X N , we see that 

11/ - s M ||i < 11/ - fXx N \\i + WfXXtf ~ smIIi < 2e, 

as desired. □ 

While the above proposition is useful, the domain of the simple functions 
in question can be quite complicated. In order to obtain more refined ap- 
proximations, it is necessary to confine ourselves to nicer measure spaces. 
For simplicity, we shall work on IR d , but the main theorem of this subsec- 



tion (Theorem 1.14) can be established on more general measure spaces. See 



§£ 1.5.5| for a discussion. 

First, we observe that it suffices to deal with simple functions on very 
nice domains. 

Theorem 1.13. The space of simple functions over cubes is dense in L 1 (R o! ). 



Proof. In light of Proposition 1.12, it suffices to approximate characteristic 



functions over finite-measure sets by simple functions over cubes. We there- 



fore fix a set E of finite measure. Pick e > 0, and invoke Proposition L6 to 
find an open set O containing E such that \0 \ E\ < e. We then have 

\\xo — Xe\\i = IxckeIIi < £■ 



Let (Q n )^Li be a Whitney decomposition of O. Since the intersection of 
two almost-disjoint cubes is of measure zero, we have 

oo 

\0\ = Y,\QnV 

71=1 

Noting that \0\ < \E\ + e, we can find an integer N such that 

oo 

E \Qn\ 
n=N+l 

This, in particular, implies that 



< e. 



N 



N 



[jQr 



^2\Qn\ < \0\ -6, 



n=l 



71=1 
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and so 



Therefore, 



A? 



\Xo 



- XQiU-UQjvlll — \Q\ - y] \Qn\ < £■ 



n=l 



\\XE - XQiU-UQ N \\l < \\XE - Xo\\l + \\XO - XQiU-UQjv||l < g £ - 

It remains to "disjointify" the cubes Q\, . . . , Qn, so as to turn XQiU---uq n 
into a finite sum of characteristic functions over cubes. For each 1 < n < N, 
we fix a cube R n in the interior of Q n such that \Q n \ R n \ < e. Then 
{Ri, . . . , Rn} is a pairwise-disjoint collection of cubes, and 



XQiU-uq? 



N 

^XRn 



n=l 



— IIXQiU-uQjv — XRiu-ur n ||i 

= ||X(Qi\.Ri)U-"U(Qiv\-Riv)|| 1 
N 

= ^ \Qn \ -Rn| 
n=l 



It now follows that 



A? 



Xe ^ ^ XRn 

n=l 



< ||X-E - XQiU-UQjvlll + 



N 



XQiU-UQ N — XRn 



n=l 



as was to be shown. 



□ 



We note that a characteristic function over a cube can be approximated 
quite easily with a continuous function: we just draw steep lines from the 
boundary of the graph down to zero, thereby producing a function with 
a tent-like graph. Precisely, we construct a tent function, which is 1 on 
a nice set — a cube in our case — and outside of a small dilation of the 
set. Once we approximate a characteristic function over an arbitrary cube 
with a continuous function, we can then appeal to the density of simple 
functions over cubes to show that integrable functions can be approximated 
with continuous functions. This is the content of the following theorem: 
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Theorem 1.14. The space C c (IR d ) of continuous functions on Mr with com- 
pact support is dense in L p {R d ) for each 1 < p < oo. 



Proof. We first prove the theorem for p — 1. By Theorem |1.13[ it suffices 
to approximate characteristic functions over cubes by continuous functions 
with compact support. This is done by constructing a tent function over the 
generic cube Q — [a 1; bi] x • • • x [a d , b d ]. To do so, we fix 5 > 0, and let 



fs(x) 



X 



if \x\ < 1; 

if 1 < \x\ < 1 + 5; 

if \x\ > 1 + 5; 



This is a tent function over [—1, 1], with decay taking place on intervals of 
length 5 to make the function continuous. We observe that 

Wfs - XhuilUw < \\X[-x+s,i+s\ - XI-i.ijIUhk) = 2( ^' 
whence f$ is a continuous approximation of the characteristic function X[-i,il- 



1 hB- 




/ 0.8 




/ 0.6 




/ 0.4 




/ 0.2 





-1.5 



-1.0 



-0.5 



0.5 



1.0 



1.5 



Figure 1.2: A one-dimensional tent function, with 5 = 0.3 



For each 1 < n < d, we consider the function 



9 n s{x) = f 



'n 



X + 



b n -\- CL n 
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This is a tent function over [a n , b n ], viz., g$ is a continuous function that 
is 1 on [a n , b n ] and vanishes outside an interval slightly bigger than [a n ,i> n ]. 
Precisely, the decay to zero takes place on the intervals [a n — (b n — a n )5/2, a n ] 
and [b n , b n + (b n — a n )8/2], so that a similar computation as above yields 

\\9s ~ X[a n ,b n ] IUhk) < ( & « ~ a ™) 5 - ( 1 - 5 ) 

We now set 

g 5 (x u ...,x d )= g}(xx) ■ ■ ■ gj{x d ). 

By construction, g$ is clearly 1 on Q and outside a cube slightly bigger 



than Q. By Tonelli's theorem and the (1.5), we have 



\9s ~ XqIIlW < ^ri( &r ' 



n=l 



Since 5 can be made arbitrarily small, we have successfully produced a con- 
tinuous approximation of xq- This proves the theorem for p = 1. 

We move onto the p > 1 case. We fix e > and claim that each / G 
D>(R d ) furnishes age L°°(R d ) and a compact set K C (IR d ) such that 
suppg C and ||/ — < e. To see this, we define the truncation operator 
T r : C — > C at r > by setting 



if \z\ < r; 
if |z| > r; 



and set 

fn = XB n (0)T n f 

for each n G N. The dominated convergence theorem implies that ||/ n — 
/||p — )■ as n — > oo, and so we can pick an integer N such that ||/jv — f\\ p < 
e/2. 

Fix a second constant e\ > 0. Since /„ G L x (]R d ), we can find g' G ^(IR^) 
such that \\g' — /jv||i < £i- We set (7 = Tyi^g' and note that 

5fG^ c (IR d ), Halloo < H/jvlloo, and ||</ - < ei. 
It now follows from Holder's inequality that 

\\f-g\\ P < 11/ - JnWp + \\Jn - g\\ P 

< ^+\\9-9i\\l /P \\9-9i\\ 1 J l/p) 

< l + sl /p (2\\g\U-^\ 
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whence picking a sufficiently small E\ > yields 

\\f -g\\ P < e- 
This completes the proof of the theorem. 



□ 



1.2.2 Convolutions 

To refine our approximation techniques even further, we now introduce a 
widely used "smoothing" operation. 

Definition 1.15. The convolution of measurable functions / and g on R d 
at x G M. d is defined to be 



whenever the expression is well-defined. 

Convolutions can be thought of as a kind of weighted average. Indeed, 
if f(x) = 1, then f * g corresponds to the integral mean value of g over the 
entire space. Before we discuss why convolutions are smoothing operations, 
we establish a few basic properties thereof. 

Theorem 1.16 (Properties of convolutions). Let 1 < p < oo. 

(a) The convolution of two measurable functions is measurable. 

(b) If f and g are measurable, then f * g = g * f . 

(c) Young's inequality. If f e L p (R d ) and g E L 1 (R d ), then f * g is 
well-defined almost everywhere and 



The following inequality of Hermann Minkowski, which we shall use fre- 
quently in the remainder of the thesis, plays a crucial role in the proof of 



Theorem 1.17 (Minkowski's integral inequality). Let (X,Wl,fj) and (Y,yi, 
u) be a -finite measure spaces and f an (fix v) -measurable function on X xY . 
If f > and 1 < p < oo, then 




ll/*<?l|p<ll/Ublli- 



(c). 
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Proof. T he p = 1 case is Tonelli's theorem If 1 < p < oo, then Tonelli's 
theorem and Holder's inequality imply that each g 6 L p (X, //) satisfies the 



following inequality: 



dv{y) 



< 



i/p 



f(x,y) p d/j,(x)\ \\g\\ p ,dv{y) 



i/p 

f(x,y) p dfi(x) ) du(y) 



Let 



(p(x) 




f(x,y) dv{y). 



We know from the Riesz representation theorem that 



p = sup 

llffll P '<i 



1 G?/Z 



whence the above inequality implies that 



v \ 1 /p 

f(x,y)dv(y) ) dfi(x) 



sup 

NI P '<i 



< sup \\g\\ p/ 
llffll„'<i 



1 d\i 



f(x,y) p d/j(x) du(y) 



i/p 



f(x,y) p dfi(x)j dv(y), 

as was to be shown. 

We proceed to the proof of the basic properties of convolutions. 



□ 



Proof of Theorem 1.16] (a) Let / and g be measurable functions on M. d . We 
first show that 

h(x,y) = f(x-y) 
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is measurable on M d x M d . It clearly suffices to prove that f± (B r (z)) is 
measurable for each r > and every z G C. For each subset E of R d , we 
define 

E = {(x, y) G R d x R d : x - y G E}. 
Since the subtraction operation 

(x,y)t-+ x-y 

is a continuous map from R d x R d into M d , the set E is open whenever i£ is 
open. By taking a countable intersection, we see that E is a set if i£ is. 

We also claim that E is of measure zero whenever E is of measure zero. 
Indeed, if \E\ = 0, then we can find a sequence {O n )^ =l of open sets such 



that O n ~D E for each n and \O n \ — > as n — > 00. Given n, k G N, Tonelli's 



theorem| and the translation invariance of the Lebesgue measure imply that 
O n nB k (0)\ = 1 1 Xo n ( x ~ v)XB k (o)(y) dy dx 

Xo,A x -y)dx) XB k (o) dy 



Xo n ( x )dxj XB k (o) dy 
= \O n \\B k \. 

Therefore, if we set E k = E PI Bk(0) for each positive integer k, then Ej. C 
O n fli? fc (0) for all n and |O n nS fc (0)| — > as n — > 00. It follows that \E k \ = 0, 
whence by continuity of measure we have \E\ = 0, as desired. 

We now fix an r > and a z G C and set E = f~ l (B r (z)), so that 
E = f{\B r (z)). Since B r (z) is open, the measurability of / implies the 



measurability of E, whence Proposition 1.7 furnishes a G$ set G such that 



G D E and \G \ S| = 0. Setting F = G\E,we see that 

G = £UF 

is a set. Since |F| = 0, the above arg ument shows that \F\ = 0, whereby 
we appeal once again to Proposition 



1.7 



to conclude that £/ is measurable. 
It follows that if / and g are measurable functions on IR d , then 



(z,y) ^ f(x-y)g(y) 
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is measurable on 



. We now invoke Fubini's theorem to conclude that 



f{x - y)g(y) dy 



is measurable on M d . 

(b) This is a trivial consequence of the commutativity of multiplication 
in C and the translation invariance of the Lebesgue measure. 



(c) Since \ (f*g)(x)\ < f \f(x—y)\\g(y) \ dy, we invoke Minkowski's integral 



inequality to conclude that 



\\f*g\ 



f(x - y)g{y) dy 



dx 



i/p 



< 



i/p 

\f(x-y)\ p dx) \g(y)\dy 



\9\\u 



as was to be shown. 



□ 



We now return to the task of justifying the "smoothing operator" nick- 
name that convolutions possess. We begin by showing that the convolution 
of two compactly supported functions is compactly supported. 



Theorem 1.18. Let 1 < p < oo. Iff £ L p ( 



and g £ L 1 (R d ), then 



supp(g * h) C supp / + supp g = {x + y : x £ supp / and y £ supp g} 

As it stands now, however, it is not entirely clear how we should interpret 
the statement of the above theorem. While two functions that are almost 
everywhere are considered to be "the same" in integration theory, the tradi- 
tional notion of support can fail to assign the same support to both functions. 
To rectify this issue, we adopt a new definition: 

Definition 1.19. Let / be a complex- valued function on IR d and O the union 
of all open sets in M. d on which / vanishes almost everywhere. We define the 
support of /, denoted supp /, to be the complement of O. 

Of course, we must justify the new terminology: 

Proposition 1.20. Let f and O be defined as above. Then f vanishes al- 
most everywhere on O. If g is another function that is equal to f almost 
everywhere, then supp / = supp g. Furthermore, this definition of support 
agrees with the old definition of support for continuous functions. 
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Proof. Since M. d is second-countable, we can find a sequence (O n )^ =1 of of 
open sets such that / vanishes almost everywhere on each O n and that the 
union of all O n is O. The union is countable, and so / vanishes almost 
everywhere on O. If / = g almost everywhere, then / vanishes almost 
everywhere on a vanishing set of g, and vice versa, whence supp / and supp g 
must agree. If / is continuous, then O = / _1 (C s {0}), and so supp / = 
/ _1 ({0}), as was to be shown. □ 

With the new definition, we proceed to the proof of the theorem. 



Proof of Theorem 1.18 . By Young's inequality, the map y i-> f(x — y)g(y) is 
integrable for each x G W 1 . Writing x — supp / to denote the set {x — y : y G 
supp/}, we see that 

{f*g){x)= f{x -y)g{y)dy. 

J(x-supp/)nsupp3 

Let E = supp /+supp g for notational simplicity. We note that x E implies 
(x — supp/) fl supp g = 0, so that (/ * g)(x) = 0. Therefore, (/ * g)(x) = 
for almost every x G lR d \ E. In particular, (/ * g){x) = for almost every 
x in the interior of M d \ E, and so 

supp(/ * g) C E 

by the new definition of support. □ 

We are now ready to supply the promised justification of the smoothing- 
operations nickname. For notational simplicity we define the following short- 
hand: 

Definition 1.21. A (i-dimensional multi-index is a d-tuple 

a = («i, ...,a d ) 

consisting of nonnegative integers. We employ the following notations for 
multi-indices; here a and /3 are multi-indices, and x an element of M. d : 



a\ = «i + • • • + «rf 

d \ ai ( d ^ ad 



r a — <r a l . . . ™ Q d 
A — X 1 dj d 



dxi J \dxd, 
a±p = (ai ± p x , ■ ■ ■ , a d ± p d ). 
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The main theorem can now be stated as follows: 

Theorem 1.22 (Convolution as a smoothing operation). Convolutions are 
"smoothing operations" in the following sense: 

(a) If f G ^ c (R d ) and g G L} oc (M. d ), then f*g is well-defined everywhere and 
f*getf(R d ). 

(b) If f e tf c k (R d ) and g G Lj, c (R d ), £/ien / * g G %f fc (R d ) and 

D a (f*g) = (D a f)*g 
for each multi-index \a\ < k. The result holds for k = oo as well. 

(c) If f G tf k (R d ) and g G tf l (R d ), then f*ge ^ k+l (R d ) and 

D a ^(f*g) = (D a f)*(D^g) 
for all multi-index \a\ < k and \(3\ < I. 

Proof, (a) For each x G R d , the map x h-> f(x — y)g(y) is measurable and 
has compact support, hence integrable. Therefore, (/ * g)(x) is defined for 
all x G R d . We now fix x G lR n , pick a sequence (x„)~ =1 in IR d converging to 
x, and find a compact subset K of lR d such that 

x n - supp / = {x n -y :y G supp f}CK 

for each n G N. It then follows that / is uniformly continuous on K and 
f(x n — y) = for all n G N and y G lR d \ X. We can thus pick a sequence 
(e n )™ =1 of positive real numbers converging to zero such that 

\f(x n -V) - f(x -y)\ < E n X K (y) 

for each n G N and every y G R d . Multiplying through by |<?(y)| and inte- 
grating with respect to y, we obtain 

\(f * g)(x n ) - (f * g)(x)\ <e n [ \g(y)\dy. 

Since the right-hand side converges to zero as n — > oo, we conclude that 

lim (f*g)(x n ) = (f*g)(x), 
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as was to be shown. 

(b) We suppose for now that k — 1. The task at hand then reduces to 
establishing the claim that / * g is continuously differentiable at each igR 1 * 
and 

V(f*g)(x) = (Vf*g)(x). 
To this end, we pick x G M. d . For each y G M. d , we observe that 

Um - y) + h) - fjx -y)- Vfjx -y)-h\ = Q 

\h\->0 \h\ 

whence every e > admits M £ > such that 

\f((x -y) + h)- f(x -y)- Vf{x - y) ■ h\ < e\h\ 

for all \h\ < M s . 

Fix a compact subset K of lR d such that 

x - supp/ + S Me (0) = {{x - y) + h : y G supp / and < M e } C K. 
Since 

/((x - y) + h) - f{x -y)- V/(x - y) • /i = 
for all \h\ < M £ and y <E K, we have 

- y) + h) - f{x -y)- Vf\x -y)-h\< e\h\ XK iy) 

for all y G R d . Multiplying through by |<?(y)| and integrating with respect to 
y, we see that 

\(f*g)(x + h)-(f*g)(x)-(Vf*g)(x).h\<e\h\ [ \g(y)\dy. 

J K 

It follows that / * g is differentiable at x, with the gradient 

V(f*g)(x) = (Vf*g)(x). 

f G ^(M. d ) implies that V/ G ^ c (M d ), whence Vf * g E ^(M. d ) by (a). 
This completes the proof for k — 1. The case for > 1 now follows from 
induction. 

(c) is a trivial consequence of (b) and the commutativity of convolution, 
and the proof is now complete. □ 
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1.2.3 Approximation by Smooth Functions 

We shall now establish the final approximation theorem of this section: 
namely, the approximation of L p functions by smooth functions. As was 
hinted at in the previous subsection, we shall use convolutions to smooth out 
the approximating functions. The key result, known as approximations to 
the identity provides a widely applicable tool for generating a collection of 
approximating functions for any given LP function. 

Theorem 1.23 (Approximations to the identity). Let 1 < p < oo. If f £ 

L p (R d ) and p e L 1 (R d ) such that Jp=l, then \\f * p £ - f\\ p ->■ as e -> 0, 
where p e (x) = e~ d p{e~ 1 x) for each e > 0. 



As per Theorem 1.22 , we can make the approximating functions as well- 
behaved as we would like. Indeed, we can construct smooth approximations 
to the identity, which we shall furnish after the proof of the theorem. 

Proof. We set A f (y) = \\f(x — y) — /(x)|| p for each y 6 R d . Fix 5 > and 
invoke Theorem 



1.14 



to find f x e tf c (R d ) with H/-/! ||p < 5. Set f 2 = f - }\. 
Since fi(x—y) converges uniformly to fi(x) as y — > 0, we see that A/j(j/) — > 
as y — > 0. Moreover, Af 2 (y) < 25, whence 

as y — > 0. By Minkowski's integral inequality! we have the following estimate: 



ll/*Pe-/||p 



< 



f(x-y)p e {y)dy - f{x) 

f{%-y)pe{y)dy- f(x) I p £ {y)dy 

[f(x-y) - f(x)}p £ (y) dy 

\\f(x -y)- f(x)\\ LP{x) \p s (y)\dy 
A f (y)\p E (y)\dy 

A f(£y)\p(y)\dy; 



See §^1.5.6 for a discussion on the name "approximations to the identity" 
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the last inequality follows from the |change-of-variables formula 

We have shown above that Af(ey) — > as e — > 0. Furthermore, we have 
the bound 

\A f (Sy)p(y)\ < ||2/||>(y)|, 
whence by the dominated convergence theorem we obtain 

\im\\f * p £ - f\\ p < lim J A f (ey)\p(y)\dy 
= J kmA f (ey)\p(y)\dy, 

as was to be shown. □ 

Corollary 1.24 (Smooth approximations to the identity). There exists a 
sequence of mollifiers on IR d , which is a sequence (p n )'^' =l of nonnegative 
ff^-maps on M. d such that supp p n C B 1 / n (0) and J p n = 1 for each nGN. 
Furthermore, if f G L p {R d ), then \\f * p n — f\\ p — > as n — )■ oo. 



3.0 




25- 




I 2.0 














' 1.0 






, , , / Jl, 


0.5 




\ V i V i , , i^-x 



-1.0 -0.5 0.5 1.0 

Figure 1.3: The first four mollifiers 
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Proof. We set 

4>[x) 

and 



e V(M 2 -i) if y < 



if W > 1 



(f) £ (x) 



e d (b(e l x] 



J 4>{x) dx 

for each e > Then each <p £ is a compactly supported smooth function whose 
integral is 1, whence by Theorem 1.23 we have 

lim||/*p e -/|| p = 0. 

We now define a sequence (p n )^ =1 of functions by setting p n = <f>y n for 
each n£N. It immediately follows from the above construction that this is 
a sequence of mollifiers. □ 

The approximation theorem now follows as a simple corollary. 

Corollary 1.25. ^ c °°(lR a! ) is dense in L p (R d ) for each 1 < p < oo. 

Proof. Fix 1 < p < oo. Let (p n )^L 1 be a sequence of mollifiers, set B n = 
B n (0) for each n G N, and define a sequence (/ n )^=i by 

/n = (fXB n ) * Pn- 



Then Young's inequality implies that 

\\f-fn\\p < \\f-f*Pn\\p+\\Pn*f-Pn*(fXB n 

= \\f ~ f * Pn\\p+\\Pn*(f ~ fXB n )\\ P 

< ||/-/*Pn||p+||/-/Xi^U|Pn||l 

= ||/-/*Pn||p+||/-/XB B ||p. 



By Corollary 1.24, we have ||/— (f*p n )\\p as n -> oo, and the dominated 
convergence theorem implies that ||/ — /xs n || p — >■ as n — > oo. It follows 
that 

lim ||/ - f n \\p = 0, 

n— >oo 

as was to be shown. □ 

We conclude the section with another instant of convolutions as smooth- 
ing operations. This time, we are able to recover continuity without any 
smoothness on either side. 
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Corollary 1.26. Letl<p<oo. If f E L p (R d ) and g E L p '(M. d ), then f*g 
belongs to the space ^o(^ d ) of continuous functions vanishing at infinity. 

Proof. By Holder's inequality, / * g is well-defined everywhere on Mr. For 
each e > 0, Corollary 1.25 furnishes f e ,g e E ^ c °°(IR d ) such that 

\\f~ fe\\p < ,„ ,,| 1 || || n and \\g-9e\\p> < „,■■,„ £ .n || v 

2(||/||p+ \\g\\ P >) 2(||/|| p + IMlpO 

It then follows from Holder's inequality that 

\\f * 9 ~ fe* 9e\\oo < ||(/-/ e )*0||oo+||/ e *(0-&)||oo 

< 1111/ -AIIpIIs'IIp'IIoo + IIIIAIIplk-^llp'lL 

< 11/ - /ellplMlp' + H/elUIS - 9e\\p' 

< (II/L + \\g\\p')(\\f - fe\\p + \\g-9eh 

= e, 

whence / * g is a uniform limit of smooth functions / e * <7 £ with compact 
support. This establishes the corollary. □ 
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1.3 The Fourier Transform 

We now restrict our attention to the famous operator of Joseph Fourier, the 
Fourier transform. To motivate the definition, we consider the "limiting case" 
of the classical Fourier series 

oo 

f(n)e 2mnx/L , 

n=— oo 

of L-periodic functions / : [— L/2, L/2] — > E, whose Fourier coefficients are 
given by the formula 

1 r L/2 

f(n) = - f{x)e-* rin *' L dx 

J — Z//2 

Indeed, we make a simple change of variable in the above formula to obtain 

f(n)= f 1/2 f(Lx)e- 2 ™* dx, 

J -1/2 

and "sending L to infinity" leads us to the following: 

/oo 
f(x)e- 2 ™*dx. 
-oo 

So long as / decays suitably at infinity, the integral makes sense even when 
n is not an integer. Therefore, we replace n with a real variable £ : 

/oo 
f(x)e-™*°dx. 
-oo 

We promptly generalize the above "transform" to higher dimensions; this, 
of course, requires us to take the scalar product of multi-dimensional variables 
x and £, which we do by taking the standard dot product: 

Hi) [ f(x)e- 2 **-*dx. 
1.3.1 The L 1 Theory 

The above expression makes sense only if f(x)e~ 2nt ^ x is in L 1 for all £. Since 
| e — | _ y for all x and £, this is equivalent to the condition that / is in 
L 1 (M d ). We are thus led to the following definition: 
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Definition 1.27. The Fourier transform of / G L 1 (IR d ) is the function / 
given by 

f(0= [ f(x)e- 2m ^dx 



for each £ G M d . We also write & f to denote the Fourier transform of /. 

Note that & can be thought of as an operator on L 1 (M d ). By the linearity 
of the integral, & is a linear operator. The target space of J^", as well as a 
few other basic properties of J^", are established in the following proposition. 

Proposition 1.28. The Fourier transform of f G L 1 (IR d ) satisfies the fol- 
lowing properties: 

( a ) ll/lloo — \\f\\i- Therefore, & is a bounded linear operator from L 1 (IR a! ) 
into L°°(R d ). 

(b) f is uniformly continuous on W 1 . 

(c) Riemann-Lebesgue lemma. / vanishes at infinity, viz., /(£) — > as 
|£|-Xx>. 

Proof, (a) It suffices to observe that 

/ f(x)e~ 2 ^ x dx < f \f{x)\\e- 2 ^ x \dx < 

(b) Since \f(x + h)-f(x)\ <2\f(x)\ for all sufficiently small h G R d , it 
follows from the dominated convergence theorem that 

lim \fc + h)-fc)\ < lim [\f( x + h)-f(x)\\e- 2m ^\dx 

h^nnjty h—Hnjty I 



/lim \f(x + h) — f(x)\dx 
h^finfty 



= 0. 

(c) Let Q = [0, l] d . By|Tonelli's theorem 



1 d 



X Q (0 = / X Q (x)e- 2 ^ x dx=H e~ 2 ™^ dx n = \[ — 
J „-i Jo „_•, 



ra=l" u n=l s 

which tends to zero as |£| — >■ oo. By linearity, the Riemann-Lebesgue lemma 
holds for all simple functions over cubes. Given a general integrable function 
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/ on M. d , we can invoke Theorem 1.13 to find a simple function s e over cubes 
corresponding to each e > 0, satisfying the estimate 

ll/-/«||i<e- 

Since / e (£) — > as |£| — >■ oo, we can find a constant M such that |/ e (0l < £ 
for all |f | > M. It then follows that 

1/(01 < I&0I + f\f{x)-f e (x)\\e-W\dx 



= l/«(0l + ll/-/ e |l« 

< 2e 

for all |f | > M, whence f(g) ^ as |f | ^ oo. □ 

The Fourier transform behaves well under a number of symmetry opera- 
tions in the Euclidean space. The proof of the following proposition consists 
of trivial computations and is thus omitted. 

Proposition 1.29. Let f e L 1 (M d ) and r e R d . 

(a) & turns translation into rotation: if r^fix) = f(x — h), then 

nJ(0 = e~ 2mh <f(0. 

(b) & turns rotation into translation: if Chf{x) = e 2mx ' h f(x), then 

57(0 = ^/(0- 

(c) & commutes with reflection: if f(x) = f(—x), then 

ho = ho- 

(d) & scales nicely under dilation: if we set 6 a f(x) = f(ax) for each a > 0, 
then 

0(0 = a>-%-if(0 = a~ d f(a- l () 
for all a > 0. □ 

The Fourier transform also behaves quite nicely under differentiation. 
Indeed, the Fourier transform turns differentiation into multiplication by a 
polynomial. 
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Proposition 1.30. Let f G L 1 (IR' i ) and suppose that x n f{x\, . . . , x n , . . . , Xd) 

is an L 1 function as well. Then . . . , £ n , . . . , Xd) is continuously differ- 
entiable with respect to £ n and 

^-/(0 = ^(-27rix n /(x))(0. 
More generally, if P is a polynomial in d variables, then 
P(D)f(£) = &(P(-2mx)f(x)m and &(P(D)f)(0 = P(2m0f(0- 

Proof. Let h = (0, . . . , h n , . . . , 0) be a nonzero vector along the nth coordi- 
nate axis. By Proposition 1.29 (ii) and the dominated convergence theorem, 
we have 

lim He+y-hQ = Um , ((L^i) m ) K) 

= J?(-2mx n f(x))(0, 

as was to be shown. The second assertion now follows from linearity of the 
differential operator. □ 

To rid ourselves of technical issues that arise in dealing with non-smooth 
functions, it will be convenient to work in a space of smooth functions that 
behaves well under the key operations in harmonic analysis. Certainly, we 
would like the space to be closed under the Fourier transform. Proposition 



1.30 then implies that the space must be closed under multiplication by 



polynomials as well. We are thus led to the following definition, named after 
Laurent Schwartz: 

Definition 1.31. The Schwartz space J^(R d ) consists of functions / G 
< ^ 0O (]R rf ) with the decay condition 

sup \x a D /3 f(x)\ < oo 



for each pair of multi-indices a and /3. 
We remark that 

< ^ 6 °° (R d ) C y{R d ) C L p (R d ) 
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mollifiers, J^(R ) is nonempty. 



for all 1 < p < oo. Since ^^(R ) contains 
In fact, ^(R d ) is dense in L p (R d ), whence so is y(R d ). 

An equivalent definition for a Schwartz function is a function / E ^°°(R d ) 
that satisfies the growth condition 

sup(x) n |D Q ,/(x)| < oo 



for all natural numbers n and multi-indices a, where 



(x 



) = v / TT 



.r- 



As noted, the Schwartz space is closed under the action of the Fourier 



transform. This basic fact is an immediate corollary of Proposition 1.30 
Proposition 1.32. If f E y{R d ), then f e y{R d ). 



□ 



The Schwartz space behaves well under other important operations in 



harmonic analysis as well. We shall take up this matter in ^2.3 



We now turn to one of the fundamental questions in classical Fourier 
analysis: given the Fourier transform of a function, can we find the function 
itself? We begin with a useful proposition that allows us to "push the hat 
around" : 

Proposition 1.33 (Multiplication formula). If f,g E /^(IR^), then 



f{t)g{t)dt= / f(t)g(t)dt 



Proof. By Fubini's theorem 



f(t)g(t)dt 



f(x)e- Zmt - x dx g(t)dt 



g(t)e- 2nlpx dt f(x) dx 



g(x)f(x)dx 
f(t)g(t) dt, 



as desired. 



□ 
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We shall also need the following computation: 
Proposition 1.34. For all e > 0, we have 

This, in particular shows that the Fourier transform of the Gaussian 

r(x) = e-^ 2 

is the Gaussian itself. 
Proof. Recall that 

/ e x dx = y/n. 

J —OO 

We first consider the one-dimensional case. In fact, we fix positive real num- 
bers p and q and compute a more general integral: 



oo 

e -px* e - qi x dx = I e - P ( X *+f X ) dx 
■oo 

OO , 2 

e P(X+ 2P> 4p g] X 

-OO 

/•oo 

e -9 2 /4 P / 6 -p(-+§) 2 



oo 

e -<7 2 /4 P / e -(^) 2 



-oo 

-q 2 /Ap poo 



= e -<7 2 /4 P I 71 



e 



p 

Plugging in p = en and q = 27r£, we have 
whenever £ 6l. 
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It now suffices to observe that 



(0 



d „ 

n=l J 

d 

H 3? (e—^) (£ n ) 



n=l 
d 

n 

n=l 



£ -l/2 e -e- 1 7r^ 



£ -d/2 g - E -^|5| 2 



as desired. 



□ 



We now present a preliminary solution to the inversion problem, which is 
sufficient for the present thesis. A more detailed discussion can be found in 



£2.7.6 



Theorem 1.35 (Fourier inversion theorem). Iff G L 1 (M d ) and f G L 1 ^) 
then 

/(*) = / /(Oe^-de 

for almost every x G M d . 



By Proposition 1.32, Schwartz functions satisfy the hypothesis of the 



above theorem. In general, Proposition [L28] indicates that / must necessarily 
be a Cq map, although this is not a sufficient condition. 



Proof of Theorem 1.35 We consider the following modification of the inver- 
sion theorem: 



Since / G L (Mr), the dominated convergence theorem implies that 



lim / f(0e 2m ^ x dC 
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We now set g s (£) = e ^ e2 \i\ 2 e 2mT - x f or a fixed r. By the multiplication for- 



mula, we have 



I £ (x) = J f(t)g £ (t)dt = J f(t)g £ (t)dt. 
Proposition 1.29| (b) and Proposition 1.34 imply that 



9e(t) 



J? (e-™ 2 l^l 2 j it) 
-1, 



Setting T £ (s) 



= e- d V{e-\r-t)). 
e~ d T(e~ 1 s), we see that 



I £ (s)= I f(t)T £ (s-t)dt = (f*T £ )(s). 



(r £ ) £>0 is an approximation to the identity, and so 

lim||/ s -/|| 1 = lim||/*r e -/|| 1 = 0. 



£->0 



£->0 



It follows that (J e ) e >o converges to f f(^)e 27ri ^' x d£ pointwise and to / in L 1 , 
whence 

/(*) = I KO^dt, 



as was to be shown. 



□ 



We often write f v to denote the inverse Fourier transform of /. Note that 

f{x) = f{-x). 



The inversion formula, combined with Proposition 1.32 implies that the 
Fourier transform operator maps J^(R ) onto itself, with the inverse 



&- l {f)(x) = &(f)(-x). 

Since & is also linear, we see that & is a linear automorphism of o^(lR d ). We 
shall see that & also preserves the natural topological structure on J?^(R ), 
hence turning & into a Frechet-space automorphism. Frechet spaces are 



discussed in §2.1[ topological properties of the Schwartz space are discussed 
in SI 
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1.3.2 The L 2 Theory 

We now recall that L 2 (IR d ) is a Hilbert space, with the inner product 

(f,9h = J fS- 

Since y(M, d ) is a linear subspace of L 2 (IR d ), it inherits the inner product 
as well. As it turns out, the Fourier transform operator preserves the inner 
product: 

Lemma 1.36 (Plancherel, Schwartz-space version). & : 5f(^k d ) — > ^(R d ) 
is a unitary operator. In other words, if<p,<fiE ,y(R d ), then 

(f,9h = (f,gh- 

In particular, \\<p\\2 = 1Mb- 



Proof. By the Fourier inversion formula and Proposition 1.29 



ip{t)(j>{t) dt = 0(-t)(j)(t) dt = 0(-t)(j)(-t) dt 



and so 



It now follow from the [multiplication formula that 



(<P, 0)2 = J ¥<P = J V<P = j ¥<P= (<p, 0)2, 
as desired. □ 



Could we do better? By Theorem 1.11 the Fourier transform operator 



J^, defined on the dense subspace J^(R d ) of L 2 (IR a! ), admits a unique norm 
preserving extension on L 2 (M. d ). We call this extension the I? Fourier trans- 
form and denote it by as well. The norm-preserving property implies 
that the L 2 Fourier transform is an isometry into itself. Since an isometric 
operator on a Hilbert space is also a unitary operator, the Fourier trans- 
form preserves the L 2 -inner product as well. We summarize the foregoing 
discussion in the following theorem: 
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Theorem 1.37 (Plancherel). The L 2 Fourier transform & is a unitary au- 
tomorphism on L 2 (M. d ). In other words, the L 2 Fourier transform is linear, 
maps L 2 (R d ) onto itself, and preserves the inner-product structure of L 2 (M. d ). 
Furthermore, the L 2 Fourier transform on L 1 ^) n L 2 (R d ) agrees with the 
L 1 transform. 

Proof. The linearity of & : L 2 (IR d ) — » L 2 (IR d ) has already been established 



by Theorem 1.11 Since is dense in both L x (]R d ) and L 2 (IR d ), the unique- 



ness clause in Proposition 1.11 also guarantees that the L 1 and L 2 Fourier 
transforms must agree on L 1 R L 2 . 

We claim that ^(L 2 {R d )) is closed and dense. Indeed, if (/ n )£li is a 
sequence of functions in JF(L 2 (M. d )) that converges to / G L 2 (IR d ), then we 
can find a sequence (g n )™=i of functions in L 2 {R d ) such that g n = f n for 
each integer n. Since is an isometry, (g n )^=x is Cauchy in L 2 (M. d ), hence 
converges to g G L 2 (M. d ). Of course, g = f, and the range is closed. To 
establish the density of j£"(L 2 (IR d )) in L 2 (IR d ), it suffices to observe that 

,y(m. d ) = ^(<y(R d )) c ^(L 2 (R d )). 

This proves the claim, and it now follows that J^(L 2 (IR d )) = L 2 (R. d ). For each 
f,g G L 2 (IR d ), we invoke the polarization identity of inner-product spaces to 
see that 

(f,9h = i(||/ + ^|| a -||/-^|| a +»||/ + ^||2-»||/-v|| a ) 
= \(\\f^9h-\\f^9\\2 + i\\f + Tg\\ 2 -i\\f^gh) 
= i (ll/ + 9h - 11/ - 0II2 + «'||/ + ^|| 2 - i\\f - i»|| 2 ) 

= (/,^)2, 

whence & is a unitary automorphism on L 2 (IR d ). □ 



1.3.3 The L p Theory 

Thus far, we have seen that the Fourier transform can be defined on L 1 (R d ) 
and L 2 (M d ). In the final subsection of this section, we shall extend the Fourier 
transform operator onto other LP spaces. To this end, we establish our first 
interpolation result: 
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Proposition 1.38. Let (X,9Jl,fi) be a measure space. If\<p<r<q< oo, 
then U (X, fi) C (L p + L q ) (X, fi) . 

Proof. Let / G L r (X, /i), set = {a; : > 1}, and define g = /xe 

and /i = Note that \g\ p = \ f\ p XE < \f\ r XE, and so g G L p (X,p). If 

g < oo, then \h\ p = \ f\ p Xx^E < \f\ r Xx^E, and so h G L q (X,^). If g = oo, 
then 1 1 /i 1 1 oo < 1, and so h G L q (X,fi). It thus follows that 

/ = + /i 

is in (L p + L g )(X,/i). □ 

In view of the above proposition, we extend the domain of Fourier trans- 
form to all L p (M. d ) for 1 < p < 2 by defining the L 1 + L? Fourier transform. 
Indeed, we set 

f = 9 + h 

for each / G L p (R d ), where g G L x (R d ) and h G L 2 (R d ). The decomposition 
is not unique, of course, but the L 1 + L 2 Fourier transform is nevertheless 
well-defined. Indeed, g\ + h\ — g<z + h 2 implies that g\ — g 2 = h 2 — hi is in 
L 1 (IR d ) flL 2 (IR d ). Since the L l and L 2 Fourier transforms coincide on L 1 flL 2 , 
it follows that gl — g 2 = h 2 — hi, or 

gi + h 1 = g 2 + h 2 . 

We can now restrict the L 1 + L 2 Fourier transform operator onto each L p (WL d ) 
to define the LP Fourier transform. 

Alternatively, we could use the density of ^(IR ') to extend the Fourier 
transform on S^(M. d ) onto LPiW. ), as we shall show below that the Fourier 
transform extends to a bounded operator. Since the L 1 + L 2 definition of the 
L p Fourier transform must agree with the usual Fourier transform on <y(M. d ), 



Theorem 1.11 implies that these two definitions coincide. 



Implicit in the above argument is the second conclusion of PlanchereFs 



theorem which asserts that the L l Fourier transform and the L 2 Fourier 
transform agree on L 1 fl L 2 . In fact, it is possible to carry out the argument 
directly on the intersection, as the next proposition shows. 

Proposition 1.39. Let (X,Wl,p,) be a measure space. If\<p<r<q< oo, 
then L p (X, p) n L q (X, p) C L r (X, fi) and 



r < (1-6) 
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for all f G L p (X,/i) D L q (X,/j,), where 9 is the unique real number in (0, 1) 
satisfying the identity 

r' 1 = (1 -9)p- 1 + 9q- 1 . (1.7) 

Proof. If q = oo, then \f\ r = \f\P\f\ r -*> < \f\ p \\f\\ r - p . With 6 = 1-p/r, we 
have 

ll/||r<||/|g /r ||/||^ /r =||/||J-"" "* 
If q < oo, then we observe that 

1 1 

1 = —rr. 7^ + 



p/ (1 — 9)r q/9r ' 
whence we may apply Holder's inequality: 

\f\ r dfi 

j Ifl^V^dfi 

< 



(l-0)r \lr\\0r 
p/(l-0)r\\J Wq/9r 

\ {l-e)r/p , r N Or , 

\f\ p d») [J \f\ q d^ 

{ t 6)r \\K- 

Taking the rth roots, we obtain the desired inequality □ 

The above proposition also suggests what we should expect the codomain 
of the LP Fourier transform to be. The L l Fourier transform maps into L°°, 
and the L 2 Fourier transform maps into L 2 . For any given 1 < p < 2, then 
we might expect the LP Fourier transform to map into L p , as the constant 9 



that satisfies the identity (1.7) 



1 ' 2' 

with 1 and 2 plugged in for the L 1 and L 2 Fourier transforms, yields 

1-9 9 



oo 
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when we plug in 2 and oo, as per the orders of the the target spaces for the 
L 1 and L 2 Fourier transforms. Following this line of reasoning, we could also 



conjecture that the norm estimate (1.6) holds for operators as well, which, 
in this case, implies that 

for all 1 < p < 2. This, in fact, turns out to be true. 



Proposition 1.39 and the conjectured inequality (1.8) are special cases of 



our first main theorem of the thesis, the Riesz-Thorin interpolation theorem 



We shall study the theorem and its consequences in the next section. For now, 
we shall apply our newly established L p Fourier transform to convolutions 
and study their behaviors. First, we establish a preliminary result: 

Proposition 1.40 (Convolution theorem, L l version). If f,g G L x (]R d ), then 
f *9 = fa- 



Proof. Young's inequality shows that / * g G L 1 (IR c( ), so we can make sense 



of the Fourier transform of /* g. The proposition is now an easy consequence 
of Fubini's theorem and Proposition 1.29[ a): 



f*g(0 



f{x - y)g(y) dy 



dx 



J (/ f(x-y)e- 2 ^ x dx^g(y)dy 



f(09(y)e- 2my< dy 

mm- 



□ 



Since the Fourier transform is linear, the result extends easily to the LP 
case. The main idea is to consider the convolution operator as a bounded 



operator. We shall have more to say about this in ^2.3 See also Theorem 



1.46 in the next section for another example of a convolution operator. 



Theorem 1.41 (Convolution theorem, LP version). Let 1 < p < 2. // 

/ G L p (R d ) and g G L 1 (R d ), then jT~g = fg. 
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Proof. Fix g G L 1 (R d ) and 1 



inequality shows that f*g G L p ( 



< p 

rati' 



< 2. For each / G L p ( 



Young's 



so we can apply the LP Fourier transform 
to / * g. Consider the "convolution operator" T : L p (R d ) — > L p (R d ) defined 
by 

Tf = f*g. 



By Young's inequality, T is bounded with operator norm at most 
Therefore, the operator T x = &T : L p (R d ) -> L p ' (R d ) is bounded as well. 



Proposition 1.40 now implies that 



Tif = f*g = fg 



1.9) 



for all / G ,5^(lR d ). S^iW 1 ) is dense in L p (R. d ), and & is continuous, hence 



the uniqueness statemente in Theorem 1.11 guarantees that (1.9) holds for 
all / G L p (R d ). □ 

What can be say about the L p Fourier transform for p > 2? In order to 



1.11 



we must 
it is not 



extend L l Fourier transform on y{R, d ) to L p (JsL d ) via Theorem 
have a Banach space as a codomian. As it stands now, however 
at all clear what the target space should be. In fact, the Fourier transform 
of an IP function may not even be a function but a tempered distribution, 



which we shall define in £2.3 
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1.4 Interpolation on L p Spaces 

We now come to the first major theorem of this thesis. We have seen in the 
last section that the Fourier transform, defined as a bounded operator on 
(L 1 + L 2 )(R d ) into + L 2 )(R d ), can be "interpolated" to yield a bounded 
operator on L p (R d ) into L p '(R d ). We shall see that a general theorem of the 
kind holds. Specifically, if an operator is "defined" on both L po and L P1 and 
maps boundedly into L qo and L qi , respectively, then we shall prove that the 
operator can be interpolated to yield a bounded operator on L Pe into L ge , 
where pe and qe are appropriately defined intermediate exponents. 



1.4.1 The Riesz-Thorin Interpolation Theorem 

To state the theorem, we first need to make sense of an operator defined 



on two separate domains. Taking a cue from Theorem 1.11 , we define our 



operator on a dense subset of each Lebesgue space in question. 

Definition 1.42. Let (X, fi) and (Y, v) be cx-finite measure spaces. We fix a 
vector space D of /z-measurable complex-valued functions on X that contains 
the simple functions with finite-measure support. We also assume that D is 
closed under truncation, viz., if / 6 D, then the function 




if n < \f(x)\ < r 2 ; 
otherwise; 



defined for each < r x < r 2 , is also in D. Given 1 < p,q < 
that a linear operator T on D into the vector space Ai (Y, fi) of ^-measurable 
complex- valued functions on Y is of type (p, q) if there exists a constant k > 
such that 

\\Tf\\ q <k\\f\\ p 

for all / G D fl L P (X, y). The infimum of all such k is referred to as the (p, q) 
norm of T and is denoted by ||T|| iP ^ L 9. 

Let T be an operator of type (p , q ). We first remark that we can restrict 
the codomain of T : DnL Po (X, //) -> M(Y, v) to L qo (Y, fi), which is a Banach 



space. Since Proposition 1.12 implies that Dr\L P0 (X, fi) is dense in L P0 (X } fi), 
we may invoke Theorem 1 1.11 to construct a unique norm-preserving extension 
T : L Po (X, fi) — > L qo (Y, v). If, in addition, T is of type (pi, q\), then a similar 
argument yields the extension T : L Pl (X,fi) — > L qi (Y, u). 

We now state the interpolation theorem, due to M. Riesz and O. Thorin: 
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Theorem 1.43 (Riesz-Thorin interpolation). Let 1 < Po,Pi, qo, Qi < oo. If 

T is a linear operator simultaneously of type (po, qo) and of type (j>x, q\), then 
T is of type (pe, qe) with the norm estimate 

1 1 T 1 1 < --' 1 1 T" 1 111 — II F~T~\ 1 1 

W-L \\LP0-+Lie ^ || J Wlpo^LioW 1 Wlpi^LH 

for each 6 e [0, 1], where 

p- 1 = (l-9)p 1 + 9p^ 1 ; 
q e l = (l-e)qo 1 + Bq[\ 

We remark that the interpolation result can be described pictorially in a 
so-called Riesz diagram of T, which is the collection of all points (1/p, in 
the unit square such that T is of type (p, q). In this context, the above the- 
orem implies that the Riesz diagram of a linear operator is a convex set: for 
any two points in the Riesz diagram, the Riesz-Thorin interpolation theorem 
guarantees that the line connecting them is also in the Riesz diagram. 




0.2 0.4 0.6 0.8 1.0 



Figure 1.4: A Riesz diagram 

The interpolation theorem was originally stated by M. Riesz in |Rie27bJ. 
In the paper, the theorem was stated only for the lower triangle in the Riesz 
diagram, i.e., for p n < q n . Since the proof of the theorem made use of 
convexity results for bilinear forms, the interpolation theorem is often referred 
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to as the Riesz convexity theorem. The extension of the theorem to the 
entire square is due to 0. Thorin, originally published in his 1938 paper and 
explicated further in |Tho48] . The modern proof of the theorem was first 
provided by J. Tamarkin and A. Zygmund in |TZ44] and makes use of an 
extension of the maximum modulus principle from complex analysis: 

Theorem 1.44 (Hadamard's three-lines theorem). Let $ be a holomorphic 
function in the interior of the closed strip 

S = {z E C : < Rez < 1} 

and bounded and continuous on S. If \&(z)\ < ko on the line Imz = and 
\&(z)\ < ki on the line Imz = 1, then, for each < 9 < 1, we have the 
inequality 

|$(z)| < k\- e k\ 

on the line Imz = 9. 

Proof. We define holomorphic functions 

= and Hf n (z) = ^{z)e^/\ 

so that ty n — > ^ as n — > oo. We wish to show that |^(^)| < 1 on S. 

To this end, we first note that ^(z)! < 1 on the lines Imz = and 
Imz = 1. Moreover, $ is bounded above on S, and k$~ z kf is bounded below 
on S, hence we have the bound ^(z)! < M on S. Noting the inequality 

\^n(x + iy)\< Me- y2/n e {x2 - 1)/n < Me^ 2/ ", 

we see that ^f n (x+iy) converges uniformly to as \y\ — > oo. We can therefore 
find y n > such that \^ n (x + iy)\ < 1 for all \y\ > y n and < x < 1, whence 
by the maximum modulus principle we have |\l/ n (z)| < 1 on the rectangle 

{z = x + iy : < x < 1 and — y n < y < y n }. 

It follows that |\P n (z)| < 1 on S for each n G N, and sending n — > oo yields 
the desired result. □ 

We now come to the proof of the interpolation theorem. In the words 
of C. Fefferman in [Fef95] , the proof essentially amounts to providing an 
estimate for the integral 

J(Tf)gdu, 
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where / G L Pe (X,/j) and g G L q (>(Y,is). In light of the Riesz representation 



theorem, the supremum of the above expression equals ||T/|| 9e . To give 



an upper bound for the above integral, we begin by finding nonnegative 
real numbers F and G and real numbers <ft and ip such that / = Fe 1 ^ and 
g = Ge 1 ^, and consider the entire function 



(Tf z )g z dp, 



Y 



where f z = F az+h e l< t> an d g z = Q cz + d e 1 ^ for suitably chosen real numbers 
a, b, c, d. We pick a, b, c, d such that 



\f z \ P0 = l/r and \g z 



on the line Re z = 0, 



\f z \ p ' = |/r and \g z 



on the line Kez = 1, and 

f g = / and g z = g 

at the point z = 9. 

The first assumption yields an upper bound k' for both ||/ 2 || po and ||g z ||^ 
on the line Re z = 1, and so the L po — > L qo norm inequality of T yields the 
estimate 

\®(z)\ <k 

on the line Rez = for some constant ko- Similarly, the second assumption 
furnishes a constant k\ such that 



\®{z)\ <h 



on the line Re z = 1 . By Hadamard's three-lines theorem , we now have the 
bound \<&(z)\ < kl~ e k®, and the third assumption implies that 



(Tf)gdu 



Y 



We now present the proof in full detail. 
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Proof of Theorem \1.4S[ Fix < 9 < 1. For notational convenience, we let 

1 „ 1 



1 1 

«o = — j a i = — j a 
Po Pi 



— > A) — — > Pi — — > P 
Pe <?o qi 



With this notation, we set 

a(z) = (1 — z)a + zai and j3{z) = (1 — z)(3 + z(3i, 

so that 

a(0) = a , a(l) = ai, a(0) = a, /3(0) = /3 , /9(1) = /3i, P(9) = 0. 
We shall first prove the theorem for simple functions with finite-measure 



support, which evidently belong to DC\L PB (X, //). By the Riesz representation 

{Tf)gdu 



theorem, we have 



\\Tf\\ 



'In 



sup 



y 



for each simple function /, where the supremum is taken over all simple 
functions g of L 9 e-norm at most one. Therefore, it suffices to show that 



(Tf)gdu 



Y 



pa 



for each such g. If 



i'ii 



0, then there is nothing to prove, and so we can 



assume by renormalization of / that 



Po 



1. We thus set out to establish 



(Tf)gdu 



Y 



< h l - e h d 



for all simple functions g with ||<7|| g ' = 1. 
We now suppose that 



/ = Yl a i xE i and 9 = &fcXF fc 



fc=i 



are two simple functions satisfying the above conditions. We also assume 
without loss of generality that pe < oo and qo > 1, so that a > and /3 < 1. 
Write tij = \aj\e %d] and = l&fcle 1 ^ and set 



/* = E \^\ a{z)/a ^X Ej and <7« = ^ l&fc| (1 ^ W)/(1 ^ } e^XF fc 



k=l 
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for each zGC Then 



(Tf z )g z du 



is an entire function such that 



(Tf)gdu. 



Y 



We note, in particular, that $ is holomorphic in the interior of S and is 
continuous on S. Since T is linear, we see that 

3=1 k=l ^ ^ Y 

whence F is bounded on S. 

We now furnish a bound for $ on the lines Kez = and Kez = 1. 
Note first that \&(z)\ < H^/zllgollS'zll^ by Holder's inequality. Observing the 
identities a; (2) = ao + z{a.\ — «o) and 1 — (3(z) = (1 — /3 ) — z(/3i — /?o) on the 
line Re z = 0, we see that 



I/, 



IPO 



iarg/| f\z(ai-a )/a\ f\pg/po\po 



l/l 

iarg fl y-^/3 1 -A))/(l-/8)| 



l/l 



9e/<?o|9o 



Since T is of type (p , q ), it thus follows that 

mz)\ < \\Tf z \\ qo \\ gz \\ q , 
< M/*Ul0*IU 



\f\ p dfi 



X 



1/po 



\g\ % du 



Y 



1/90 



< ko\\f\\ p p e/po \\g\\f /q '° 

< k Q 

on the line Kez = 0. A similar computation establishes the bound |$(^)| < 
k\ on the line Re z = 1 , whence by Hadamard's three-lines theorem| we have 
the inequality 

on the line Kez = 9. Setting z = 8,we have 



(Tf)gdv 



1$ 
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which is the desired inequality. 

Having established the theorem for simple functions, we now prove the 
theorem for the general function / G D fl L p (X,fi). To this end, we shall 
furnish a sequence (f n )^=i of simple functions such that 

hm ||/ n - /|U = and lim (Tf n )(x) = (Tf)(x), 
for then Fatou's lemma yields the inequality 

||T/|| W < hm ||T/ B || W < hm A^II/nlL = A^Xll/IU- 



Therefore, the task of proving the theorem reduces to finding such a sequence. 

We assume without loss of generality that / > and po < p±. Let f° be 
the truncation of /, defined as 



f(x) if /(./ ) > 1: 
if f(x) < 1; 



and f 1 — f — f° another truncation. D contains all truncations of /, and 
(f°) po and (J 1 )? 1 are bounded by f p , hence f°eDn L P0 (X,/j,) and f 1 E 
D fl L Pl (X,fi). We can find a monotonically increasing sequence (g m )m=i 
converging to /, which satisfies 



lim \\g m - f\\ pg = 

m— >oo 

by the monotone convergence theorem. If and g^ are truncations of g m 
defined in the same way as f° and f 1 , then we have 

lim \\g° m - f°\\ Pe = lim Wglf 1 ^ = 0. 

Since T is of types (po,qo) and (pi,qi), we have 

lim \\T S ^-Tf\\ qo = lim \\Tg l m -Tf 1 \\ 9l = 0. 

m->oo m— ^oo 

We can then find a subsequence of (Tg^)^ =1 converging almost everywhere 
to Tf°, whence we may as well assume that the full sequence converges 
almost everywhere to Tf°. Similarly, we can find a subsequence (^ m „)^=i of 
(<7m)m=i such that (Tg^ )^ =1 converges almost everywhere to Tf 1 , whence 
the sequence (f n )^ =1 defined by setting 



fn — g mn + g mn 



is the desired sequence. This completes the proof of the Riesz-Thorin inter- 
polation theorem. □ 
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1.4.2 Corollaries of the Interpolation Theorem 

We now recall the Hausdorff- Young inequality from the last section: 

Theorem 1.45 (Hausdorff- Young inequality). For each 1 < p < 2, the LP 

Fourier transform is a bounded linear operator from L p (M> d ) into LP (IR d ). 
Specifically, we have the inequality 



v < 



pi 



whence the LP — > L p operator norm of ^ is at most 1. 

The inequality is now a trivial consequence of the interpolation theorem, 
for the Fourier transform is a bounded operator from L l into L°° and from 
L 2 into L 2 , whose operator norm is at most 1 in both cases. 

Another application is the following generalization of Young's inequality) 



Theorem 1.46 (Young's inequality). Lf 1 < p, q, r < oo such that 



1 1 1 

-+-=1+- 
p q r 



then 



\\f*9\\r< 

for all f e LP(R d ) and g G L g (R d ). 



Again, the proof is a routine application of the interpolation theorem on 
the convolution operator 

Tg = f*g, 



once we recall the Young's inequality to establish that T is of type (l,p) and 
invoke the bound 



ll/*0l|oo < 



\9\ 



from Theorem 1.26 to prove that T is of type (p' , oo). We shall have more 



to say about convolution operators in ^2.3 The constants in the above 



inequalities can be improved: see §£1.5.8 for the sharp versions. 
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1.4.3 The Stein Interpolation Theorem 

Let (X, 971, /i) and (Y, 91, v) be a-finite measure spaces and T a linear oper- 



ator of types (po-,%) and (pi,qi). Recall that the proof of the Riesz-Thorin 



function 



interpolation theorem essentially amounted to giving an estimate of the entire 



What if we let the operator T vary as well? C. Fefferman points out in |Fef95] 
that the net result of adding a letter z to the operator T is the new entire 
function 

= J (TJ z )g z dv, 

whence establishing an estimate of $ produces another interpolation theorem. 
The similarity of the operators suggest that the new proof should closely 



mimic that of the Riesz-Thorin interpolation theorem, and this is indeed the 
case. 

We thus obtain an interpolation theorem that allows the operator to vary 
in a holomorphic manner, as the letter z suggests. The interpolation theorem 
was first established by E. Stein in |Ste56j and is dubbed interpolation of 
analytic families of operators in the standard reference |SW71] of E. Stein and 
G. Weiss. In the present thesis, we shall refer to it as the Stein interpolation 
theorem. 

In order for $ to be holomorphic, we must impose a restriction on how 
the operator can vary: 

Definition 1.47. Let (X, 971, /x) and (Y, 01, v) be a-finite measure spaces and 



S= {z G C : < Rez < 1}. 



We suppose that we are given a linear operator T z , for each z G S, on the 
space of simple functions in L l (M,fi) into the space of measurable func- 
tions on N. If / is a simple function in L l (M,fi) and g a simple function 
in L 1 (A^, u), we assume furthermore that (T z f)g G ^(N, v). The family 
{T z }zes of operators is said to be admissible if, for each such / and g, the 
map 

z / (T z f)gdu 
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is holomorphic in the interior of S and continuous on S, and if there exists a 
constant k < tt such that 



sup e" fc|Im21 



(T z f)g dv 



N 



< oo. 



With this hypothesis, we can state the interpolation theorem as follows: 

Theorem 1.48 (Stein interpolation theorem). Let (X, 9Jt, p) and (Y, 91, v) be 

a-finite measure spaces and {T z } zGS an admissible family of linear operators. 
Fix 1 < po,pi, qo, qi < oo and assume, for each real number y, that there are 
constants Mo(y) and M\(y) such that 

HWIU < M (y)\\f\\ P0 and \\T 1+iy f\\ qi < M 1 (y)\\f\\ Pl 

for each simple function f in L l {M,p). If, in addition, the constants Mj(y) 
satisfy 

sup e~ %l log Mj(y) < oo 

— oo<y<oo 

for some k < tt, then each 9 G [0, 1] furnishes a constant M e such that 



\\Tof\\q e < Me\\j i ipe 
for each simple function f in L l (M,p), where 

p- 1 = (l- 0)00* + Apr 1 ; 

q e l = (1 - 0)q? + Oq?. 



Theorem 1.11 then implies that Tq can be extended to a bounded oper- 
ator on L Pe (M. d ) into L"(IR d ). For the sake of convenience, we shall use the 
term operator of type (pe, q$) for Tg as well. The proof of the interpolation 
theorem makes use of an extension of IHadamard's three-lines thcorcml due 
to I. Hirschman: 

Lemma 1.49 (Hirschman). If $ is a continuous function on the strip S that 
is holomorphic in the interior of S and satisfies the bound 

supe- fe|Im2| log|$(^)| < oo (1.10) 
for some constant k < tt, then 

logimi <^ir + iy (1 . u) 

2 J_ 00 cosh Try — cos ttU cosh iry + cos uu 

for allO E (0,1). 



1.4. INTERPOLATION ON L p SPACES 



63 



Why is this an extension of Hadamard's three-lines theorem? If $ is 
bounded and continuous on S, \<&{z)\ < ko on the line Imz = and |3>(z)| ^ 
k\ on the line Imz = 1, then the bound 

supe- fc|Imz| log|$(z)| < oo 

zes 

is satisfied for k = 0. Furthermore, we have 

i r 'og dy = e and i r jggwi+jOL „ = l _ 9i 



2 J_ oa cosh 7ry — cos tt9 2 cosh 7r?/ + cos nO 

whence Hirschman's lemma implies that 

\<&{x)\ < kl- e k[. 

We have thus recovered the three-lines theorem from Hirschman's lemma 



Proof of Lemma \l-4$\ Let D be the closed unit disk in C. For each £ in 
D \ { — 1, 1}, we define 

<p{Q = —. iog(V 



ni V 1 - C 
<p is a composition of the conformal mapping 

.i + C 



c 



of .D \ { — 1,1} onto the closed upper-half plane EI and of the conformal 
mapping 

1 , 

w z = — log w 
ixi 

of EI onto the closed strip S. Therefore, h maps D \ { — 1,1} conformally 
onto S, and the inverse 

C-^\z) _,. . 

e + % 

is conformal as well. ^ = $ o ip is holomorphic on open unit disk and is 
continuous on D\ {-1,1}. 

We now recall the following standard result)^] from complex analysis: 

3 See pages 206-208 of [Ahl79j for a detailed discussion 
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Lemma 1.50 (Poisson- Jensen formula). Let ^ be a holomorphic function 
on an open disk of radius R centered at 0. If, for some < p < R, we write 
ax, . . . , ajy to denote the zeroes of \l/ in the open disk \z\ < p, then 



N 



log |^0) 



5> 

n=l 



p - a n z 



p{z - a n ) 



+ 



2tt 



27r nP ie _|_ 7 

Re^^log|/(pe*>)|d0 
pe w — z 



for all \z\ < r such that f(z) ^ 0. 



For < p < i? < 1, we can write ( = pe lw and apply the above lemma to 
obtain 



log |*(C)| < 



R 2 -p 2 



2ir J_ w R 2 -2Rpcos(oj 



log |^(^)| 



;i.i2) 



Rewriting (1.10) in terms of \l/ and i] = ip 1 (x + iy), we have 



log Itffa) | <C\\1 + T] 



-k/TT 



ii-^r fe/7r ] 



for some constant C independent of r] G D. Plugging in 77 = Re 1 ^ and 
noting that k/ir < 1, we see that the above inequality permits us to use 
the dominated convergence theorem to the integral in (1.12). Therefore, by 
sending R — >• 1, we obtain 



log M0\ < 



2n J_ w 1 — 2p cos(u — (p) + p 2 



log 



;i.i3) 



We now apply a change of variables to (1.13) inequality to obtain ( |1.11[ ). 
First, we convert the condition 



< u = <p(pe iul ) < 1 
into a condition on p and u. Indeed, we note that 



;i.u) 



pe' 



tp-\0) 



COS7Tt 



+ i 



1 + sin tt9 



COS 7Tt 



sin7r# 



>/2 



whence (1.14) implies that 
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Therefore, if < 9 < |, then 

1 - p 2 1-p 2 



1 - 2pcos(a; - <p) + p 2 1 - 2pcos(0 + tt/2) + p 2 

= 1-P 2 

1 + 2psin0 + p 2 
sin 7r# 



1 + cos ti9 sin ( 



Furthermore, the above equality holds for | < < 1 as well. 
We now observe from the identity 



e" 7 ^ + z 

that y ranges from +oo to — oo as <p ranges from — 7r to 0. Moreover, 

1 7T 

sin0 = and dcj) = dy, 

cosh iry cosh ny 

and so the change-of- variables formula] yields 

1 f° 1 - n 2 

1 1 log|*(e^)|# 



2n J_ 7T 1 — 2pcos(w — 0) + p 2 
smn8 f 00 log \$(iy)\ 



cosh iry — cos ir9 



dy. 



Similarly, as <fi ranges from to 7r, the function <p(e^) produces the points 
1 + iy with — oo < y < oo, whence 

' 1 1_/r -log|^(e^)| 



27r y o 1 — 2pcos(a; — 0) + p : 
sinvrfl Z" 00 log|$(l + iy)| 



_ oc cosh 7ry + cos ii9 



dy. 



We add up the two quantities and plug the sum into (1.13) to obtain (1.11), 
which was the desired inequality. □ 

We are now ready to present a proof of the interpolation theorem. 
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Proof of Theorem \1.48[ Fix < 6 < 1. As in the proof of the Riesz-Thorin 



interpolation theorem[ we let 



"o = — , «i — — , a = — , fa = — , /3i = —, (3 = —. 
Po Pi Pe qo qi qe 

With this notation, we set 

a(z) = (1 — z)a + za.\ and f3{z) = (1 — z)/3q + z(3i, 

so that 

a(0) = a , a(l) = a lt a(0) = a, /5(0) = ft, 0(1) = P(9) = 13. 
Let 

m n 

f = E a i XE i and 9 = E &fcXF * 
j=i fc=i 

be simple functions such that / G L 1 (X,fi), g G L 1 (Y, u), and 

11/11* = i = bh- 

Write Oj = |%|e*^ and b k = \b k \e %lfk and set 

m m 

U = E M Q(2)/a ^ and fo = £ Iftfcl^WVti-^e^^ 
i=i fc=i 

for each z G C. Then 

$(z)= [ (T z f z )g z du 
is an entire function such that 



Y 



${6) = J(T e f)gdu. 

We note, in particular, that $ is holomorphic in the interior of S and is 
continuous on S. Since T z is linear, we see that 

j=l k=l \ Jy J 
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It follows that {T z } z( z S is an admissible family, whence $ satisfies the 
bound 

supe- fc|Imz| log|$(^)| < oo. 



Furthermore, we have 



\fiy\ P0 = l/r = \fi + iy\ Pl and \g iy \«'° = \g\4 = |«&- 



■iv\ 



\<h 



for all y £ R, and so Holder's inequality implies that < M Q (y) and 



|$(1 + iy)\ < Mi(y). Hirschman's lemma now establishes the bound 



< 



exp 



sm 7Tt 



log | | log|$(l + iy)| 

H ; : „ ay 



cosh rry — cos ttO cosh Try + cos 7r# 



whence we invoke the Riesz representation theorem to conclude that 



\Tef\V 



sup 

HsL' =1 



(Tof)g dv 



<Ma = Ma 



as was to be shown. 



□ 



See §£2.7.6 for an application of the Stein interpolation theorem in the 



context of the Fourier inversion problem. In £2.5, we shall prove Fefferman's 



generalization of the Stein interpolation theorem that allows us to take a 
particular subspace of L 1 as the domain for one of the endpoint estimates. 
The theory of complex interpolation, which provides an abstract framework 



for Riesz-Thorin, Stein, and Fefferman-Stein, will be developed in £2.2 
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1.5 Additional Remarks and Further Results 

In this section, we collect miscellaneous comments that provide further in- 
sights or extension of the material discussed in the chapter. No result in the 
main body of the thesis relies on the material presented herein. 

1.5.1. If X is a topological space, then a compactificatior^\oi X is defined to 
be a compact Hausdorff space Y such that X embeds into Y as a dense subset 
of Y. The extended real line K can be considered as a compactification of 
R, if, in addition to the open sets in M, we declare the intervals of the form 
[— oo, a) and (a, oo] as open sets in K. With this topology, an extended real- 
valued function / is measurable if and only if is measurable for every 
open set U in K, thus conforming to the standard definition of measurability. 



1.5.2. For 1 < p < oo, the Riesz representation theorem continues to hold 



on non-a-finite measure spaces: see Theorem 6.15 in |Fol 99j . It thus follows 
that L p spaces for 1 < p < oo are always reflexive Banach spaces. As for 
p — 1, the representation theorem continues to hold for a slightly milder 
condition that p, be decomposable: we say that p is decomposable if there is 
a pairwise disjoint collection of finite /^-measure such that 

(a) The union of all members of & is the base set X; 



(b) If E is of finite //-measure, then 



(c) If the intersection a subset E of X and each element of the collection & 
is //-measurable, then E is //-measurable. 

See Theorem 20.19 in |HS65j for a proof. 

The dual of L°° is strictly bigger than L 1 , even on the Euclidean space. 
We define a bounded linear functional Iq on C c {R d ) by setting 



W) = /(o) 



4 See §29 and §38 in jMunOOj or Proposition 4.36 and Theorem 4.57 in |Fol99] for 
standard methods of compactification in point-set topology. 
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for each / G C C (R ). By Theorem 2.9, there exists a bounded extension I of 



io on L°°(IR d ). Assume for a contradiction that 

/(/) = J fudx 

for some u G L 1 ^). Then / fu = for all / G C c (M d ) such that f(0) = 0, 
whence the discussion in §1.5.7| establishes that u = almost everywhere on 
M. d \ {0}. Therefore, w = almost everywhere on M d , and so 

/m dx = 

for all / G L°°(]R d ), which is evidently absurd. 

In general, the dual of L°° (X, 9Jt, /x) is isometrically isomorphic to the 
Banach space of finitely additive measures of bounded total variation that 
are absolutely continuous to /i, which jDS58j denotes as ba(X, 9JI, //i). See 
§IV.8.16 in [D S58] or §IV.9, Example 5 in [Yqs80] for a proof of this repre- 
sentation theorem. Another "representation theorem" can be established if 
we consider L°° ClS db C*-algebra: see §12.20 in |Rud91j . 

1.5.3. The Whitney decomposition theorem states that every nonempty 
closed set F in M. d admits a countable collection {Q n : n G N} of almost- 
disjoint cubes such that the union of the cubes is W 1 \ F and 

diamQ n < d(Q n ,F) < 4diam(Q n ) 

for each nGN. See § VI. 1.2 in |Ste70j or Appendix J in [Gra08aj for a proof. 
Compare the decomposition theorem with the C alder on- Zygmund lemma: if 
/ is a nonnegative integrable function on M. d and a a positive constant, then 
there exists a decomposition IR d = F U Q such that 

(a) F H Q = 0; 

(b) f{x) < a almost everywhere on F; 

(c) Q is the union of almost-disjoint cubes {Q n : n G N} such that 

a < -—- I f(x) dx < 2 d a. 
\Qn\ Jo n 
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This is of paramount importance in the theory of singular integrals, which 
we briefly touch upon in £2.4 A proof of the lemma can be found in |Ste70] . 
1.3.3, although the lemma is usually integrated into the C alder on- Zygmund 
decomposition in many expositions: see Theorem |2.60| for details. 



1.5.4. Littlewood's three principles serve as a useful guide for studying the 
properties of the Lebesgue measure. The first principle states that every set 
is nearly a finite sum of intervals. To make this precise, we recall that the 
symmetric difference of two sets E and F is defined by 

EAF = (E\F)U(F\E). 

Proposition 1.51. If E C ]R d is of finite measure, then, for each e > 0, 
there exists a finite sequence (Q n )n=i of cubes such that 



N 



EA |J Q r 



n=l 



< e. 



The second principle, which states that every function is nearly continu- 
ous, can be stated as follows: 

Theorem 1.52 (Lusin). If E is a finite-measure subset ofW 1 and f : E — y C 
a measurable function, then, for each e > 0, there exists a closed subset F of 
E such that \E \ F\ < e and f\p is continuous. 

The third and the final principle states that every convergent sequence of 
functions is nearly uniformly convergent and can be formulated precisely as 
follows: 

Theorem 1.53 (E gorov). If E is a finite-measure subset ofM, d and {f n )'^Li 
a sequence of measurable functions on E that converge pointwise almost ev- 
erywhere to f : E — )• C, then, for each e > 0, there exists a closed subset F 
of E such that \E \ F\ < e and f n —*f uniformly on F. 

Lusin's theorem can be established on more general domains, which can 



then be used to generalize Theorem 1.14 We shall take up on this matter in 
the next subsection. 



1.5.5. Theorem 1.14 can be generalized to locally compact Hausdorff do- 
mains with complete Borel regular measures. This is a direct consequence of 
the generalized Lusin's theorem: 
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Theorem 1.54 (Lusin). Let // be a complete Borel regular measure on a lo- 
cally compact Hausdorff space X. If f is a complex-valued measurable func- 
tion on X, A a finite-measure subset of X, and supp /Ci, then each e > 
admits a function g G C C (X) such that 

fi({x : f(x) 7^ g(x)}) < e and sup \g(x)\ < sup |/(x)|. 

See §2.24 in |Rud86] for a proof. Crucial in proving the above theorem is 
Urysohn's lemma for locally compact Hausdorff spaces, which can be stated 
as follows: 

Theorem 1.55 (Urysohn's lemma). Let X be a locally compact Hausdorff 
space. For each open subset O of X and a compact subset K of O, we can 
find a function f G C C (X) such that < f(x) < 1 on X, supp / C O, and 
f(x) = 1 on K . 

A proof of the above theorem can be found in |Rud86j . §2.12. An even 
more general version of Lusin's theorem for Radon measures appears in 
|Fol99j as Theorem 7.10. Urysohn's lemma continues to hold for normal 
spaces: see §33 in |Mun00j or Theorem 4.15 in |Fol99j for a discussion. 



1.5.6. The name approximations to the identity (as in Theorem 1.23 and 



Corollary 1.24) can be motivated by considering a more abstract framework. 
A Banach algebra is a (complex) Banach space srf with associative bilinear 
multiplication operation satisfying the inequality 



\xy\\ < \\x\\ \\y\ 



for all x,y G stf ' . Young's inequality shows that the space L 1 equipped with 
the convolution operation is a Banach algebra. 

Given a Banach algebra £/, a left approximate identity, or a left approxi- 
mation to the identity, is a sequence 



{e n )^ =1 of elements in srf such that 



lim 

n— ¥co 



77 







for all x G s# . While e n is not quite a multiplicative identity of the Banach 
algebra stf ' , it serves as one "at infinity" — hence the name "approximation to 
the identity" . Right approximate identities can be defined analogously. 
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1.5.7. Another useful corollary of Theorem 1.23 is that, for 1 < p < oo, an 
L p function u on an open subset O of M. d is zero almost everywhere on fl in 
case 

f fu = 



I 



for all / e ^ c °°. See J 1.5.2 



A consequence of the above result is that proving the equal almost- 
everywhere state of two functions amounts to integration the difference of 
two functions against a small class of "test functions". This idea will be a 
recurring theme in distribution theory, which will be developed in §2.3 



1.5.8. The optimal constant in the <i-dimensional Hausdorff- Young inequal- 
ity is Cp, where 

1/p x 1 / 2 



Cp 



Xp') 1/p '. 

The equality is achieved if and only if / is of the form 

f(x) = Aexp (—x ■ Ax + b • x) , 

where A is a real symmetric positive-definite matrix and b a vector in C n . 
The optimal constant was given for all 1 < p < 2 by William Beckner in 
|Bec75j . The necessary and sufficient condition for the equality, published in 
|Lie90j . is due to Elliott Lieb. 



The optimal constant for the d- dimensional generalized Young's inequal- 



ity, also established by Beckner in |Bec75j . is (C p C q C r >) d , where the constants 
are defined as above. In [Lie90] . Lieb gives a necessary and sufficient con- 
dition for an even more general version of Young's inequality. See |LL01] , 
Theorem 4.2 for a textbook exposition of the fully generalized Young's in- 
equality. 



Chapter 2 



The Modern Theory of 
Interpolation 

The idea of the proof of the Riesz-Thorin interpolation theorem can be gen- 
eralized to a class of operators on spaces other than the Lebesgue spaces. 
Alberto P. Calderon's insight was to consider interpolation as an operation 
on spaces, rather than on operators. First published in [Cal64j, Calderon's 
complex method of interpolation absorbs the complex-analytic proof of the 
Riesz-Thorin interpolation theorem and provides an abstract framework in 
which many new interpolation theorems can be generated. 

On the more concrete side, it became increasingly evident that many use- 
ful operators simply were not bounded on L 1 or L°°. Hardy spaces and the 
space of bounded mean oscillation were introduced as well-behaved substi- 
tutes, and it was proven by Charles Fefferman and Elias M. Stein in |FS72] 
that operators on these spaces can be interpolated as if they are on Lebesgue 
spaces. Precisely, the Fefferman-Stein interpolation theorem is a general- 
ization of the Stein interpolation theorem on analytic families of operators, 
where the operator on L l can be bounded only on a subspace H 1 of L 1 , and 
the operator on L°° can satisfy a weaker estimate — » BMO) than the 
usual estimate — y L°°). 

The necessary machinery for these interpolation theorems is developed in 
the first and the third sections. In the fourth section, we study the Hilbert 
transform, which sets the stage for the Fefferman-Stein theory sketched in the 
fifth section. In the sixth and the last section, the Fefferman-Stein theory is 
applied to the study of differential equations and Fourier integral operators, 
and the complex interpolation method is applied to Sobolev spaces. 
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2.1 Elements of Functional Analysis 

We begin the chapter by establishing a few key results from functional anal- 
ysis, as explicated in many references such as [B rellj . |Lax02] . |Rud91j . 
|Yos80| . and JDS58J. We have already encountered functional-analytic meth- 
ods in Chapter 1, in which we often investigated the properties of spaces of 
functions rather than those of individual functions. We shall raise the level of 
abstraction even higher in this section, and study various kinds of topological 
vector spaces: 

Definition 2.1. A real or complex vector space V is a topological vector space 
if V is equipped with a topology such that the addition map (x, y) i— > x + y 
and the scalar multiplication map (a, v ) h-> av are continuous. 



2.1.1 Continuous Linear Functionals on Frechet Spaces 

LP spaces and, in general, Banach spaces are canonical examples of topologi- 
cal vector spaces. Some function spaces, however, do not admit one canonical 
norm. They might come with multiple natural norms; there might not be a 
convenient quotient construction to turn the norm-like map into a genuine 
norm. We are thus led to the following generalization: 

Definition 2.2. A seminorm on a topological vector space V is a function 
p : V — > [0, oo) such that p(av ) = \a\v and p{v + w) < p{v) + p{w) for all 
scalar a and vectors v and w. 



We shall see in [2.3 that the canonical topology on o$^(lR d ) is given by a 
countable collection of seminorms. This turns J^(R ) into a Frechet space, 
which we now define. 

Definition 2.3. Let V be a real or complex topological vector space. V is 
a Frechet space if V is equipped with a countable collection {p n : n 6 N} of 
seminorms such that 



d(v, w) 



n=l v 



p n {v - W) 



+ p n (v-Vj) 

is a metric generating the topology of V. 

We note that the seminorms p n are continuous in the topology defined as 
above. It is clear that every Banach space is a Frechet space. While most 
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function spaces we study in the present thesis are Banach spaces, we shall 
see that some are more naturally described in the language of Frechet spaces. 

For now, we study a different problem: namely, the existence of nontriv- 
ial continuous linear functionals. We know from linear algebra that finite- 
dimensional vector spaces have as many linear functionals as the vectors 
therein, and that each vector space is isomorphic to its dual space. It is not 
at all clear, however, if there are nonzero linear functionals on an infinite- 
dimensional space, let alone continuous ones. 

There are, of course, some infinite-dimensional spaces with nontrivial con- 
tinuous linear functionals. For example, we have seen that the LP spaces have 



infinitely many bounded linear functionals. Indeed, the Riesz representation 



theorem asserts that the dual of L p spaces are highly nontrivial. 



Definition 2.4. The dual space of a topological vector space V is the vector 
space V* of continuous linear functionals on V. 

Our immediate goal, then, is to show that the dual of many topologi- 
cal vector spaces are nontrivial. This requires a preliminary result, widely 
regarded as one of the cornerstones in functional analysis. 

Theorem 2.5 (Hahn-Banach). Let V be a real vector space and p a semi- 
norm on V. Lf M is a linear subspace ofV and I a real linear functional on 
M such that l{v) < p(v) for all v G M, then there exists a linear functional 
L on V such that L\m = I and L(v) < p(v) for all v G V. 

Proof. If M = V, then there is nothing to prove, and so we may suppose the 
existence of a vector z G V \ M. We first show that we can extend I by one 
extra dimension. If L is an extension of I on M © such that L(v) < p(v) 
for all fv G M © M.Z, then, for any y , y\ G M, we have 

L(y ) + L{y x ) = L(y + y x ) 

< p(yo + yi) 

= p(y - z + z + yi) 

< p(y -z)+p(z + y 1 ). 

This implies that 

L(Vo) - p(yo ~z)< p(z + yi) - L(yi), 

and so 

sup L(y) - p(y - z) < inf p(z + y) - L(y). (2.1) 
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Recall now that every v G M © BLz can be written as the sum 

v = y + Xz, 

where y G M and A el. We fix a real number a such that 
sup L{y) - p(y- z) <a< inf p(z + y) - L(y) 

and define 

L(v) = L(y) + Xa 

for each v G M, so that L is a linear extension of / on M. Furthermore, if 
A > 0, then 

L(v)=L(y + Xz) = L(y) + Xa 

= X[L{X- l y/) + a] 

< X {L{X- l y) + [p(z + X-'y) - L(X~ 1 y)] ) 

= Xp{z + X- l y) 

= p(y + A^) = p(v) 

by ( pi] ). If A < 0, then 

L(v) = L(y + Xz) = L(y)-{-X)a 

= (-A) [L{—X~ l y) — a] 

< (_ A ) (K-X-'y) - [L{-X- l y) - p^-X^y - z)]) 

= {-X)p{-X- l y-z) 

= p{y + ^) = P (v). 

If A = 0, then L(y + Xz) = l(y), and there is nothing to prove. We have thus 
demonstrated that we can always extend / by one extra dimension. 

Let us now return to the proof of the theorem in its full generality. We 
define a partial order < on the collection of ordered pairs (/', M') of linear 
extensions /' of / on M' satisfying the bound l'(v) < p(v) for all v G M' by 
setting (/', M') < (I", M") if and only if M' C M" . Given a chain 

(li,Mx) < (Z 2 ,M 2 ) < (h,M 3 ) <■ (2.2) 

of such ordered pairs, we define the pair (l^, M ra ) by setting 



M OQ =[jM n and ^ = l n (v) 
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where n is chosen such that v G M n . The definition of is unambiguous, 
because the linear functionals agree on common domains. It is easy to see 



that (Zoo, Mqo) is an upper bound of the chain (2.2). 

We now invoke Zorn's lemma to construct a maximal element (L, Mo) of 
the collection given above. If M ^ V, then we can extend L by one extra 
dimension, whence (L, M ) is not maximal. It follows that L is the desired 
linear functional, and the proof is now complete. □ 

Since most functions we study in this thesis are complex-valued, the cor- 
responding function spaces are mostly complex vector spaces as well. We 



therefore require the following generalization of the Hahn-Banach theorem 



commonly referred to as the complex Hahn-Banach theorem. 

Theorem 2.6 (Bohnenblust-Sobczyk). Let V be a complex vector space and 
p a seminorm on V . If M is a linear subspace of V and I a complex linear 
functional on M such that \l(v)\ < p(v) for all v G M, then there exists a 
linear functional L on V such that L\m = Z and \L(v)\ < p{v) for all v G V. 

Proof. We first note that Z can be written as 

Z = Zi + il 2 

where l\ and A 2 are real linear functionals on V, considered as a real vector 
space. For each v G M, we see that 

h(iv) +il 2 (iv) = l(iv) = il(v) = i [h(v) + ih{v)\ = ih(v) - h(v). 

This implies that 

k(v) = -h{iv) 
for all v G M. Since we have the bound 

h(v) < \h(v)\ < \l(v)\ < p(v) 



for all v G M, we can invoke the |Hahn-Banach theorem| to construct an 
extension L\ of l\ on V such that L\ is still dominated by p. 
We define a complex linear functional L on V by setting 

L(v) = Li(v) — iL 1 (iv) 

for each v G V, which is easily seen to be an extension of Z. To show that 
1-^(^)1 ^ p(v) for each v G V, we fix a v G V and find r > and 9 G [0, 2tt) 
such that L(v) = re l6 . We then have 

\L(v)\ = r = re ie e- ie = e- ie L(v) = L(e- i6 v). 
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Noting that \L(v)\ > 0, we conclude that 

\L(v)\ = L(e- w v) = L l (e- ie v). 

Since 

-Lx{e- iB v) = Lti-e^v) < p{-e~ ie v) = p{e~ iB v), 
we see that \Li(e~ ie v)\ < p(e v). It thus follows that 

\L(v)\ = L^v) = \L x {e- ie v)\ < p(e~ w v) = \e*\p(v) = p(v), 

as was to be shown. □ 

We are interested in bounded linear functionals, so we now put a topology 
on our vector space. The corresponding extension theorem is the following: 

Theorem 2.7. Let V be a real or complex topological vector space. If p 
is a continuous seminorm on V and vq a vector in V, then there exists a 
continuous linear functional L on V such that L(vq) = p(vq) and \L(v)\ < 
p(v) for all v G V . 

Proof. We let F denote either 1 or C and define a linear functional / on the 
one-dimensional subspace Fv of V by setting 

l(Xv ) = \p(v ). 



Since |Z(At> )| = l-V^o)! = p(^ v o)i we can a PPby either the real Hahn-Banach 



theorem| or the complex Hahn-Banach theorem| to construct a linear func- 



tional L on V such that L\^ Vo = I and \L(v)\ < p{v) for all v E V. L is a 
linear extension of I, and so L(vo) = l(vo) = p(vo). Furthermore, the bound 
1-^(^)1 < p(v) implies that L is continuous at v = 0, whence by linearity it is 
continuous on V. □ 

For a large class of topological vector spaces known as locally convex 
spaces, a stronger extension theorem can be established. Since we will not 
need such generality, we state and prove a special case of the theorem for 
Frechet spaces. See § §2.7.1 for a discussion of locally convex spaces. 



Corollary 2.8. Let V be a Frechet space. For every nonzero vector Vq in 
V , there exists a continuous seminorm p on V such that p(vq) ^ 0. Conse- 
quently, there is a continuous linear functional I on V such that 1(vq) ^ 
and \l(v)\ < p(v) for all v G V. 
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Proof. Let {p n : n G N} be a collection of seminorms generating the topology 
on V, all of which must be continuous. Given a fixed nonzero vector vq in 
V, we observe that there exists an iV G N such that pn(vq) ^ 0. If not, 
then the distance between vq and the zero vector in the metric generated by 



{p n : n G N} is zero, which is evidently absurd. Theorem 2.7 can now be 
applied to pjy, whereby obtaining a continuous linear functional continuous 
linear functional I on V such that l(v ) = p(v ) ^ and \l(v)\ < p(v) for all 

v eV. □ 

If V is a Banach space, then the extension theorem can be strengthened 
as follows: 

Corollary 2.9. Let V be a Banach space. For every nonzero vector vq in 
V, there exists a continuous linear functional I such that 1(vq) = \\vo\\ and 

Proof. Fix a nonzero vector v G V. Since the norm || • || is a continuous semi- 
norm on V with ||f || 7^ 0, we apply Corollary 2.8 to construct a continuous 
linear functional / such that l(v ) = \\v \\ and \l(v)\ < \\v\\ for all v G V. The 
second conclusion implies that ||/|| < 1, whence the first conclusion implies 
that ll/ll = 1. □ 



2.1.2 Complex Analysis of Banach- valued Functions 

Let us now consider an application of the extension theorems established 
in the previous subsection. As alluded to in the beginning of the chapter, 



we shall extend the Riesz-Thorin interpolation theorem (Theorem 1.43) to a 



more general framework in ^2.2 To do so, we shall need to consider Banach- 
valued functions on C. It is therefore convenient to be able to apply complex- 
analytic methods to functions on C mapping into a complex Banach space. 

Definition 2.10. Let V be a complex Banach space and O a connected open 
subset of C. A function / : O — > V is holomorphic if, for every bounded linear 
functional I on V, the composite map If is a holomorphic function on O as 
a complex function of one variable. 

Since there are plenty of bounded linear functionals on V, the definition 
is non-trivial. Indeed, it is sufficiently restrictive that the theorems of com- 
plex analysis, such as Louiville's theorem, continue to hold. Note that every 
bounded linear operator T : V — >■ W between Banach spaces preserves the 
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holomorphicity of V^-valued functions, for the composition of T and an arbi- 
trary bounded linear functional on W is a bounded linear functional on V. 
Indeed, if / is a ^-valued holomorphic map, then, for each linear functional 
I on W, the composite map ITf is a complex- valued holomorphic map. It 
then follows that Tf is a W- valued holomorphic map. This technique is used 



in the proof of Theorem 2.32 



If V is a Banach space of linear operators^ then we can talk about power 
series in V, where the infinite sum is defined by the limit of the Cauchy 
sequence of partial sums. In this V^-valued function on O with a 

power-series expansion in V is holomorphic. We shall see an application of 
this technique in the next subsection. 

2.1.3 Spectra of Operators on Banach Spaces 

Among the central objects of study in finite-dimensional linear algebra are 
eigenvalues and eigenvectors: an eigenvalue of an n-by-n matrix A with 
complex entries is a complex number A such that 

Av = Xv 

for some nonzero n-vector v, which is referred to as an eigenvector of A with 
respect to the eigenvalue A. Writing / to denote the n-by-n identity matrix, 
we see that the eigenvectors with respect to an eigenvalue A are precisely the 
elements of the nullspace of A — XI. In other words, if a complex number A 
renders the matrix A — XI invertible, then the nullspace of A — XI is trivial, 
whence there are no "eigenvectors" corresponding to A. This implies that A 
is not an eigenvalue of A. 

Let us now consider a bounded linear operator T on a complex Banach 
space V. We define the resolvent set p(T) of T to be the collection of complex 
numbers A such that the operator 

I — XT 

is invertible. The spectrum cr(T) of T is defined to be the set C s p(T). 
We observe that these definitions are straightforward generalizations of the 
above observation. 

Recall that finding the eigenvalues of an n-by-n matrix A amounts to 
solving the polynomial equation 

det(A -XI) = 

1 or, more generally, a Banach algebra: see §j jf .5.6 for the definition. 
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for A. If A has complex entries, then the fundamental theorem of algebra 
guarantees that the roots always exist, whence every matrix with complex en- 
tires has eigenvalues. As it turns out, our infinite-dimensional generalization 
retains this property. 

Theorem 2.11. Let V be a complex Banach space and T : V — >■ V a bounded 
linear operator. The spectrum a(T) ofT is nonempty. 

To prove this result, we first observe that invertible operators form an 
"open set" . 

Lemma 2.12. Let V be a complex Banach space and T : V — >■ V a bounded 
linear operator. If T is invertible, and if another bounded linear operator 
T' : V — > V satisfies the norm estimate 

\\T \\v^v < 



llr-lv-v' 
then T — T 1 is invertible. 

Proof of lemma. We assume for now that T is the identity operator /. In 
this case, the lemma asserts that all bounded linear operators I — T' with 
the norm estimate ||T'|| v ^v < 1 is invertible. In this case, the sequence of 
partial sums 

N 

I,I + T',I + T'+ (T') 2 , • • • ^(T') n , • • • 

n=0 

is Cauchy in the space J£?(V) of bounded linear endomorphisms on V, which 
is Banach. Therefore, the operator 

oo TV 



Af-5>oo ■ 
n=0 n=Q 



n=0 

is well-defined. We observe that 

oo 

(/ - T') V (T') n = Urn I- (T') n+1 = / 

and that 



TV-s-oo 
n=0 



Y{T') n (/ - T') = Urn I- (T') n+i 

— ' / TV^oo 



K n=0 
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whence / — T' is invertible and 

oo 

(I-T'Y 1 = Y,( T T- (2-3) 



n=0 



We now consider an arbitrary bounded, invertible linear operator T : 
V — > V. Fix a bounded linear operator T' : V — > V such that 

ll^'llv^v < urn in ■ 



W 1 ||v-»v 

We factor 

T — T' = T(I — T- l T') 

and observe that 



ilT -1 li 

\rp— lrpl\\ ^ \\rp— 111 I|T _ '/|| ^ ll J l|V— >V -i 

| J J llv^y S Ik llv-s-vll-' \\v-+v < ,, rr _ 1 n — !■ 



Therefore, the argument carried out in the above paragraph shows that 
/ — T~ X T' is invertible. Since T is also invertible, it follows that T — T' 
is invertible, as was to be shown. □ 

We also establish the following computational result: 

Lemma 2.13. // T and T' are bounded, invertible linear operators on a 
Banach space V, then 

T -i _ ( T ')-i = 7-1(2" _ T){T')-\ 
Proof of lemma. Observe that 

T -i _ ( T ')-i = T-^T'XT')- 1 - T^TiT')- 1 = T'^T' - T)(T')-\ 
as was claimed. □ 
We now return to the proof of the main theorem. 



Proof of theorem 2.11 Let T : V — > V be a bounded linear operator. If 
A G p(T), then Lemma 2.12 implies that 

(A - e)I — T= (XI -T)-eI 
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is invertible for sufficiently small e G C. It follows that p(T) is open, and so 
cr(T) is closed. 

We claim that A i— > (XI — T) _1 is a ^-valued holomorphic map on p(T). 
For each fixed p G p{T), Lemma 2.13 implies that 

{XI -T)~ l - jpI-T)- 1 {XI - T)-\pI - XI){pI - T)- 1 
A — p X — p 

= -{XI -T)- 1 {pI -T)-\ 

Therefore, we have 

lim {XI-T)- 1 -{pI-T)- 1 = nm _ (A/ _ T) -i (/i/ _ T) -i = _ (A/ _ T) -2 ; 

At->A A — p fi->X 

and so 

lim "' A/ - r )' 1 )-"'" / - r )") = ,((A/ - TV) 

iu->A X — p 

for each bounded linear functional / on V. It follows that A >->■ (A/ — T) _1 is 
holomorphic. 

Let us now suppose for a contradiction that p(T) = C. This, in particular, 
implies that A — > 1{{XI — T)~ x ) is an entire function for every bounded linear 
functional I on V. We apply the power-series expansion Since continuous 
maps send compact sets to compact sets, this map is bounded on every 
compact subset of p(T). We apply the power-series expansion (2.3 ), employed 



in the proof of Lemma 2.12, to {XI — T) 1 



{XI - T)- 1 = X-\I - X^T)- 1 = X- 1 X ~ nT ' 

n=0 

It follows that 

\\{XI-TY X \\ V ^ V < 



A 



and so || (A/ — T) 1 1| — > as |A| — > oo. 
Furthermore, we have 

(/((AJ-T)- 1 )! < ph^cIKaj-t)- 1 !!^ < 



All-IITI 



for each bounded lienar functional / on V, whence A 1{{XI—T) x ) vanishes 
at infinity. Louiville's theorem can now be applied to conclude that this map 
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is the zero map, regardless of our choice of /. Given a fixed A G p(T), however, 
(XI — T) _1 is invertible and thus not the zero operator. We invoke Theorem 



2.9 to construct a bounded linear functional I on V such that ||Z|| = 1 and 

||Z((AI - T))- 1 !!^ = \\(XI - T)- l \\ v ^y > 0. 

This is evidently absurd, since the map A t— > l((XI — T)^ 1 ) was shown to be 
the zero map. It now follows that p(T) ^ C, whereby we conclude that cr(T) 
is nonempty. □ 

We remark that the above theorem does not guarantee the existence of 
eigenvalues proper for all bounded operators. Indeed, the right-shift operator 
R : Z 2 (Z) — > Z 2 (Z) defined on the space l 2 (Z) of square-summable sequences 
by 

R{(0"n)neZ) = ( a n+l)neZ 

admits no eigenvalues or eigenvectors, for the identity 

Xa n = a n+ i 

does not hold for all sequences (a n ) n6 z, regardless of the value of A. 

The proof of the theorem goes through verbatim if we substitute the 



bounded operators with elements of a Banach algebra. See 1.5.6 for the 
definition of a Banach algebra. 

It is sometimes useful to classify operators by the content of their spectra. 

Definition 2.14. A bounded operator T : V — > V on a complex Banach 
space V is positive if the spectrum consists of positive real numbers, and 
negative if the spectrum consists of negative real numbers. 



We shall prove a theorem about positive operators in ^2.2.3 An example 



of a positive operator can be found in ^2.6.3 



2.1.4 Compactness of the Unit Ball 

We now consider a property of finite-dimensional vector spaces that do not 
generalize to infinite-dimensional spaces. Recall the Heine-Borel theorem, 
which guarantees that the closed and bounded sets in R d and C d are compact. 
As it turns out, the norm topology on an infinite-dimensional Banach space 
has "too many open sets" to preserve this property. 
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Theorem 2.15. The closed unit ball in a Banach space V is compact if and 
only if V is finite- dimensional. 

We will not have an occasion to use this theorem, so we omit the proof: 
see, for example, Section 5.2, Theorem 6 in |Lax02j . The proof can be 
sketched easily if V is a separable Hilbert space. Recall the standard re- 
sult in Hilbert-space theory that there is an orthonormal basis {e n : n G N} 
of V: intuitively, there are countably many axes in V that are perpendicular 
to one another. If we construct an open cover of V such that each "axis" of 
the unit ball in V is covered by an open set that does not overlap much with 
the rest of the cover, then none of the countably many sets covering these 
axes can be removed without losing the open-cover status. 

Given the usefulness of compact sets, we seek a way to reduce the number 
of open sets, thereby increasing the number of compact sets. To this end, we 
recall that the dual of a Banach space V is a Banach space, and that the map 
v (->• L v from V to its double dual V** defined by L v (l) = Iv is an isometric 
embedding. 

Definition 2.16. Let V be a Banach space. The weak-* topology on V* with 
respect to V is the topology generated by the sets L~ 1 (0), where L v is the 
linear functional in the above proposition and O an arbitrary open set in C. 



With this new topology, Theorem |2.15 can now be reversed. 



Theorem 2.17 (Banach- Alaoglu). If V is a Banach space, then the unit ball 
in V* is compact in the weak-* topology. 

Proof. Let F denote either R or C. We consider the product space ¥ v con- 
sisting of all functions from V into F, equipped with the standard product 
topology. An arbitrary element of ¥ v shall be denoted by u = (cj v ) ve v, where 
uj v is the value of u evaluated at v. 

We now consider the topological embedding $ : V* — > ¥ v defined by 
$(/) = (co v ) ve v, where u v = l(v). It suffices to check that $(-B) is compact, 
where B is the closed unit ball in V* . We observe that $(-B) is the collection 
of u — (u v ) v( zv in F y such that \u v \ < \\v\\v, oj v+w = u v + u w , and u\ v = Xu v 
for all A G F and v , w G V. By setting 

K x = {u G ¥ v : \u v \ < \\v\\v for all v G V} 
K 2 = {u G ¥ v : tu v+w = uj v + u w and U\ v = \tu v 
for all A G F and v, w G V}, 
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we see that $(5) = K x n K 2 . 

Observe that K\ is a product of the closed intervals [— as v 

runs through V, and so Tychonoff's theorem implies that Ki is compact. 
Furthermore, the sets 

E VjW = {uo G ¥ v : — Wt, — oo w = 0} 
-^a,d = {w G F y : w At) - Aw,, = 0} 

are closed for each fixed A G F, whence 









( n e ^ 






\v,wev J 




\ AeF / 



is closed. Therefore, $(B) = K 1 C\ K 2 is compact, and so is B. □ 

2.1.5 Bounded Linear Maps between Banach Spaces 

An important property of complete metric spaces is the Baire category theo- 
rem, which states that no complete metric space can be written as a countable 
union of nowhere dense sets, viz., sets whose interiors of their closures are 
empty. In this subsection, we apply the category theorem to the study of 
linear maps between banach spaces and derive two powerful consequences. 

Recall that a function / : X — > Y between topological spaces X and Y 
is open if f(U) is open in Y for each open set U in Y and The first result 
concerns open linear maps between Banach spaces. 

Theorem 2.18 (Banach-Schauder, open mapping theorem). Let V and W 

be Banach spaces and T : V — >■ W a bounded linear transformation. If T is 
surjective, then T is open. 

Proof. We write B^(x) and B^(y) to denote the open balls of radius r 
centered at x G V and y G W, respectively. If we can show that T(i?J / (0)) 
contains an open ball centered at the origin, then the linearity of T establishes 
the desired result. To this end, we shall first show that T(_B| / (0)) contains 
an open ball centered at the origin. Since T is surjective, 

oo 

w=\Jt(bX(o)), 

n=l 
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whence the Baire category theorem implies the existence of an integer n such 
that T(B^ Q (0)) is not nowhere dense. Therefore, T(B^ Q (0)) has a nonempty 
interior, and so the linearity of T yields a point wq £ W and a real number 
e > such that 

BY {wo) C T(Bfi, 

We now fix a point t> i £ -8^(0) such that w\ = T{v\) satisfies the distance 
estimate ||wi — wo|| < s/2. If w £ B^ 2 (Q), then 

|| («; - wi) - w \\ < \\w\\ + ||i/;i - w || < | + | = £, 



and so w - Wi £ B^(wo) Q T(BY(0)). Since w = T(^i) + w — w±, the 
linearity of T implies that w £ T^^O)). Once again, the linearity of T 
establishes the inclusion 



^ 4 (0)CT(Bf(0)), 

and our claimed is proved. We remark that we can assume without loss of 
generality that 

b?{o) c fjBjm, 

by rescaling T in the above inclusion if necessary. This, in particular, implies 
that 



B^(0)CT(5j; 2 „(0)). (2.4) 

for each n £ N. 

We now show that 

BY,M C T«(0)), (2.5) 

which then establishes the theorem. Fix w £ By 2 (0), which by (2.4) is 

in T(5jy 2 (0)). We can then find V\ £ £>^ 2 (0) with the distance estimate 
||wi — T(t>!)|| < 2~ 2 , or, equivalently, the inclusion 

Wl -T(v 1 )eB$#{0). 



Applying (2.4) once again, we see that w\ — T(vi) £ T(By 22 (0)), whence we 
can find Vi £ B^ 22 (0) such that 



w 1 -T(v 1 )-T(v 2 )eB^{0). 
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Continuing the process, we obtain a sequence (v n )^ =1 of vectors in V such 

that \\v n \\ < 2~ n . The sequence (t>H \-vn)n=i of partial sums is therefore 

Cauchy, and the completeness of V furnishes a limit v = Yl v n with the norm 
estimate 



Now, T is continuous, and 



\v\ < 



N 



1. 



(2.6) 



n=l 



W 



-En 



n=l 



< 2 



-n-l 



for each N e N, whence it follows that \\w — T(v)\\ = 0, or w = T(v) 



Combined with (2.6), this establishes (2.5), and the proof is complete. □ 



The second result concerns closed linear maps between Banach spaces. 
An operator T : V — >■ W between two normed linear spaces V and W is 
closed if v n — > v in V and Tv n — > w implies Tv = w. This is not equivalent 
to the notion of a closed map in point-set topology, which refers to a function 
/ : X — > Y between topological spaces X and Y such that f(U) is closed in 
Y whenever U is closed in X. 

Theorem 2.19 (Closed graph theorem). Let V and W be Banach spaces 
and T : V — )■ W a linear transformation. If T is closed, then T is bounded. 

Why the name closed graph! Recall that the graph of a linear map 
T : X — )■ Y between two Banach spaces V and W is defined to be the set 

Gj- = {(v,T(v)) G V x W : v G V}. 



v n ,T(v n )) -> (v,Tv), 



If T is closed, then (v n ,T(v n )) — >■ (v,w) implies that 
which is in Gt- Conversely, if Gt is closed, then v n — > v and Tv n — > w 
implies that the limit \im(v n ,Tv n ) = (v,w) must be in Gt, whence w = Tv. 
It follows that the closedness of T is equivalent to the closedness of its graph 
Gt- 

Proof. We first note that the normed linear space V x W with the norm 
\\(v, w)\\vxw — IMIv+ ll^llw is a Banach space^] Indeed, the norm properties 



2 The second half of Proposition 2.25 establishes a strengthening of this result for the 
internal sum V + W. To see why this is a generalization, we recall that the product V x W 



is isomorphic to the direct sum V © W, which equals the internal sum V - 
if V and W have a trivial intersection as subspaces V © W. 



W if and only 
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are established. If {(v n , w n )}^ =l is a Cauchy sequence in V x W, then (v n )^ =1 
and (w n )^ =l are Cauchy in V and W, respectively, and so v „ — > t> and u>„ — >• tu 
for some u G V and weW. It now suffices to note that 

||(t> n ,u> n ) - (v,w)\\ VxW = \\(v n - v,w n -w)\\vxw = \\v n -v\\ v + \\w n -w\\w, 

which converges to as n — > oo. It follows that V x W is a Banach space, 
whence the closed subspace Gt of V" x W is also a Banach space. 

We now consider the projection maps Py : Gt —> V and Pw '■ Gt — > W 
defined by 

Py{v,Tv) = v and Pw(v,Tv) = Tv. 



Since Py is a bijective, bounded linear map, the |open mapping theorem 



implies that Py is an open map. This, in particular, shows that the inverse 
Py 1 is a bounded linear map. Likewise, P\y is a bounded linear map, and so 
the composition 

T = P W Py 1 

is bounded as well, thereby establishing the theorem. □ 



We remark that the open mapping theorem can be proven using the 
closed graph theorem, thereby establishing the equivalence between the two 
results. We shall see an application of the closed graph theorem in the next 
subsection. See § §2.7.6 for another important corollary of the Baire category 



theorem and its application to the Fourier inversion problem. 



2.1.6 Spectral Theory of Self- Adjoint Operators 

We now restrict our attention to Hilbert spaces, on which we can establish 
substantial generalizations of many results from finite-dimensional linear al- 
gebra. In particular, we shall focus on the problem of diagonalization in this 
subsection. 

Recall that an n-by-n matrix A with complex entries is self- adjoint if 
A equals the conjugate transpose A* of A, and unitary if A' 1 equals A*. 
A standard result in linear algebra is that every self-adjoint matrix A is 
unitarily equivalent to a diagonal matrix, viz., there exists a diagonal matrix 
D and a unitary matrix U such that D = U~ l AU. Since the columns of a 
unitary matrix form an orthonormal basis, it follows that every self-adjoint 
matrix can be diagonalized with respect to an orthonormal basis. This is the 
spectral theorem. 
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The spectral theorem can be generalized considerably. To this end, we 
first define the infinite-dimensional generalization of self-adjoint matrices. 

Definition 2.20. Let H be a complex Hilbert space. A linear operator 
T : H — > H is self-adjoint if 

{Tv,w) H = (v,Tw) H 

for all v,w G H. 

We show that self-adjoint operators on Hilbert spaces must be bounded . 

Proposition 2.21 (Hellinger-Toeplitz). If T is a self-adjoint operator on a 
Hilbert space H , then T is bounded. 

Proof. Fix vectors v , w, v±, v 2 , ■ ■ ■ , v n , . . . in H such that v n — > v and Tv n — > 
to. By the self-adjointness of T, we have 

(Tv n ,x) H = (v n ,Tx) H 
for each n G N and every x G H, whence taking the limit yields 

(w,x) H = (v,Tx) H . 
Applying the self-adjointness of T once again, we have the identity 

(w,x) H = (Tv,x) H 

for all x G H. Therefore, 

(w — Tv, x)h = 

for all x G H, and, in particular, 

(w — Tv, w — Tv)h = \\w — Tv\\ 2 H = 0. 

It follows that w = Tv, and so T is closed. We now invoke the closed graph 
theorem (Theorem 2.19) to conclude that T is bounded. □ 

Note, however, that there is nothing "topological" about the definition 
of self-adjointness, and so we seek to generalize the definition to unbounded 
operators. In light of the above proposition, we are forced to define un- 
bounded self-adjoint operators only on proper subspaces of the Hilbert space 
in question. 
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Definition 2.22. Let H% and H 2 be complex Hilbert spaces, D a dense 
subspace of Hi, and T : D — > H 2 a linear operator. We define D* to be the 
collection of all w G H 2 such that, for each v G Hi, there exists a vector 
w* G i?i satisfying the identity 

(Tv,w)h 2 = (v,w*) Hl - 
The adjoint of T is the operator T* : D* — )■ i?i defined to be 

T*w = 

at each w 6 /)*, so that 

(Tt>, w) = (v, T*w) 

for each v E D and every w £ D*. T is self- adjoint if D = D* and T = T*. 

A remark is in order. The density of .D guarantees that there is only one 
w* for each w G D, whence T*w is unambiguously defined. Of course, D* is 
a linear subspace of H 2 , and T* a linear operator. Henceforth, we shall write 
T>(T) to denote D, and T>(T*) to denote £)*. Furthermore, we shall speak of 
an unbounded operator T on Hi, although it is defined only on T>(T). 

Let us now return to the task at hand. A unitary operator on a Hilbert 
space Hi into another Hilbert space H 2 is a bounded linear operator U : 
Hi — > H 2 such that T _1 = T*. The spectral theorem for linear operators on 
separable Hilbert spaces can now be stated as follows: 

Theorem 2.23 (Spectral theorem). Let T be an unbounded self-adjoint oper- 
ator on a separable Hilbert space H. There exists a measure space (X, 9Jt, fi), 
a unitary operator U : L 2 (X,fi) — > H, and a real-valued [i-measurable func- 
tion a on X such that 

U~ 1 TUu(x) = a(x)u(x) 
for all Uu G T>(T). Furthermore, Uu G T>(T) if and only if au G L 2 (X,fi). 

This formulation of the spectral theorem is that of Theorem 1.7 in Chap- 
ter 8 of |TaylOa| . The proof is rather elaborate and requires heavy machinery, 
so we omit it. See Chapter 8 of |TaylOa| , Chapter 32 of [Lax02] . Chapter 13 
of |Rud91j . or Chapter XI of |Yos80j for an exposition of spectral theory of 
unbounded operators. For our purposes, it suffices to consider an extension 
of the spectral theory of bounded operators on Banach spaces discussed in 
§pX3j 
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The resolvent set p(T) of a self-adjoint operator T : Hi — )■ H 2 between two 
complex Hilbert spaces Hi and H 2 is the collection of all complex numbers 
A such that T — XI is a bijective map from T>(T) onto H 2 . The spectrum 
o~(T) of T is the set C \ p{T). It can be showrj^] that the spectrum of a 
self-adjoint operator consists of real numbers. This framework allows us 
to extend Definition 2.14 and consider positive or negative unbounded self- 
adjoint operators. We shall prove a theorem about positive unbounded self- 
adjoint operators in §§2.2.3 



3 See, for example, Theorem 5 in Chapter 31 of Xax02 . 



2.2. TEE COMPLEX INTERPOLATION METEOD 



93 



2.2 The Complex Interpolation Method 

In this section, we study the basics of Alberto Calderon's complex method 
of interpolation, following [Cal64j, |BL76j . and |T aylOb| . Calderon's the- 
ory serves as a turning point for the development of interpolation theory, 
providing a new abstract framework that allows the theory to grow beyond 
the realm of classical harmonic analysis. Despite its prevalence in standard 
expositions of modern interpolation theory, the language of category theory 
is avoided in this section. See § §2.7.3 for a brief sketch of the categorical 
formulation. 



2.2.1 Complex Interpolation 

The main novelty of the theory is the shift in focus from interpolation of 
operators to interpolation of spaces. Take the Fourier transform operator, for 
example. We have used the Riesz-Thorin interpolation theorem (Theorem 



1.43) to the L 1 Fourier transform and the L 2 Fourier transform to obtain a 
new Fourier transform operator on, say, L 15 . It can also be said, however, 
that we have determined L 1,5 to be an "interpolation space" between L 1 and 
L 2 . To make this notion precise, we first pick out the pairs of spaces which 
we can interpolate. 

Definition 2.24. A (complex) Banach couple is an order pair (Bq,Bi) of 
(complex) Banach spaces such that both Aq and A\ are continuously embed- 
ded into a Hausdorff topological vector space V. 



Recall from { 1.3 that we have defined the LP Fourier transform by defining 
the L l + L 2 Fourier transform operator and restricting it onto the "interpola- 
tion spaces" LP . Had L 1 + L 2 not be well-defined, this line of reasoning would 
have been nonsensical. The continuous-embedding criterion guarantees that 



B + Bi is well-defined, as we shall see in Proposition 2.25 below. 



We also recall that Proposition 1.39 provided another way of defining 



the LP Fourier transform: namely, extending via Theorem 1.11 the Fourier 
transform on L 1 R L 2 . This extension, moreover, agreed with the restriction 
of the L 1 + L 2 Fourier transform. 

We would like to model our abstract framework on the two modes of 
interpolation discussed above. This, above all, requires the two spaces B +Bi 
and BqHBi to be well-defined and well-behaved, which we establish promptly. 
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Proposition 2.25. If(B ,Bi) is a Banach couple, then BqHBi is a Banach 
space with the norm 

IMUonBi = max{|H| Bo , \\v\\ Bl }, 
and Bq + B\ is a Banach space with the norm 

IMU0+-B1 = inf IKIUo + IKII-Bi- 

v=vq+vi 

Furthermore, both Bq D B\ and Bq + B\ are continuously embedded into the 
ambient Hausdorff topological vector space. 

Proof. We first show that Bq D B\ is a Banach space. It is easy to check that 
II ■ ||B nBi is a norm. If (v n )™ =1 is a Cauchy sequence in Bq fl Bi, then we 
can find v G B and i>' G £>i such that ||f n — v\\b — > and \\v n — v'\\b 1 — > 
as n — > 00. Norm convergences in 5 and -Bi must agree with convergence 
in the topology of the ambient Hausdorff topological vector space, whence 
the limit must be unique. Therefore, v equals v' and is consequently in 
Bq fl B\. Furthermore, it is now evident that (f n )^Li converges to v in the 
norm topology of B fl B x , thus establishing the completeness of || • HbodBi- 
We now turn to B + B 1 . Clearly, || • ||b +Bi is a norm. If (v n )£!Li is a 
Cauchy sequence in B + B 1 , then we can find sequences (w„)™ =1 and (x n )™ =1 
in B and B±, respectively, such that v n = w n + x n for each n G N. Since 
(w n )^ =1 and (x n )™ =1 are Cauchy sequences in Bq and B±, respectively, we 
can find w E Bq and x G B\ such that ||it>„ — w\\b — > and ||x n — xHbj — )■ 
as n — > 00. It now suffices to observe that 

lim \\v n - (w + x)\\ Bo+Bl < lim \\v n - w n \\ Bo + lim ||t> n - x n || Bl = 0, 

n— ¥00 n— ¥00 n— >oo 

whence £? + B\ is complete. 

We now let V be the ambient space in which Bq and B\ are continuously 
embedded. We can take the embedding Bq fl B\ ^ V to be the restriction 
of either the embedding I?o V or the embedding Bi^FonSonSi. As 
for Bq + I?! we note that both B and B\ can be identified with complete 
and thus closed subsets of V, whence the standard gluing lemma^] of point- 
set topology applied to the embeddings Bq ^ V and B\ <—± V furnishes a 
continuous embedding of Bq + Bi into V . □ 



4 See, for example, Theorem 18.3 in |Mun00j . 
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We now set out to generalize the Riesz-Thorin interpolation theorem in 



the new framework. Let us recall that the proofs of the interpolation theo- 
rems involved placing the two endpoint operators on the two boundaries of 
the strip 

S = {z E C : < Rez < 1} 

and examining what happens in the middle. This was done by encoding 
the operators in a function that is continuous on S, holomorphic in the 
interior of S, and suitably bounded on the two boundaries of the strip, and 
then establishing the intermediate bound via Hadamard's three- lines theorem 



(Theorem 1.44). The key is to leave behind the complex- valued functions on 
S, and to consider instead the Banach-valued functions on S. The same 
argument will then produce new spaces — as opposed to operators — in the 
middle of the strip. 

Definition 2.26. A space-generating functionf^for a complex Banach couple 
(.Bo, Bi) is a function / : S — > B Q + B\ such that 

(a) / is continuous and bounded on S with respect to the norm of B + B\\ 

(b) / is holomorphic in the interior of S, as per the definition of holomor- 
phicity in § pX2| 



(c) / maps into B Q and is continuous with respect to the norm of B on the 
line Re 2; = and decays to zero as | Imz| — > 00 on this line; 

(d) / maps into B\ and is continuous with respect to the norm of B\ on the 
line Rez = 1 and decays to zero as | lmz\ — > 00 on this line. 

The collection of all space-generating functions for (B , B{) is denoted by 
F(B ,B l ). 

The interpolation spaces, which we shall define in due course, will be 
subspaces of B + B\ isomorphic to a quotient space of T{Bq, B\). We first 
check that ^(Bq, Bi) is indeed a well-behaved space of functions. 

Proposition 2.27. ^{Bq^i) is a Banach space, with the norm 



j = max sup \\f(iy)\\ Bo , sup ||/(1 +iy)\\ Bl 

oo<y<oo — oo<y<oo 

3 This is not a standard notion and will not be used beyond this subsection. 
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Proof. It is trivial to check that || • || is a norm, so it suffices to check that 
T(B ,Bi) is complete. To this end, we suppose that (/ n )5£Li is a Cauchy 



sequence in ^(Bq, B\). For each z = x + iy in the strip S, the three-lines 



lemma provides the estimate 



\\fn(z) - f m (z)\\ Bo+Bl < max <^ sup || ||b +Bi, sup 11/(1 + iy)\\ BQ +B 1 

I y y 

— 1 1 fn fm 1 1 T j 

whence (f n )n°=i converges uniformly to a function / G B + B\. By the 
uniformity, the function / is continuous and bounded on S and holomorphic 
in the interior of 5*. 

Since the Bq + B\ norm is determined by the maximum of the Bo norm 
and the B\ norm, we see that (f n )n^=i converges uniformly to a limit go in Bq 
and another limit gi in B\. The Hausdorff condition of the ambient topo- 
logical vector space guarantees that / = g = g%- The uniformity once again 



guarantees that the conditions (c) and (d) in Definition 2.26 are satisfied, 
whence / is in J^(Bq, B\). It now follows from the definition of the J-"-norm 
that || f n — f\\jr — >■ 0, and our claim is established. □ 

We are now ready to give the main definition of the section. 

Definition 2.28. Let (Bq, B\) be a complex Banach couple and fix 9 G [0, 1]. 

The complex interpolation space of order 9 between B and B\ is the normed 
linear subspace 

B e = [Bo, B X \ B = {veB + B 1 :v = f(9) for some / G T(B , B,)} 
of Bo + B\ , with the norm 

Hie = IMk = j n t V\t- 

f(6)=v 



We first check that the interpolation spaces are well-behaved. For this 
purpose, let us recall that the quotient norm on the quoient space B/N of a 
Banach space B is given by 

HHI|s/at= inf \\w\\ B = inf ||u + w||b- 

w£[v] w£N 



It is a standard result that the closedness of N guarantees the completeness 
of the quotient norm, thereby turning B/N into a Banach space. 
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Proposition 2.29. For each 9 G [0,1], the interpolation space [B ,Bi]g is 
isometrically isomorphic to the quotient Banach space J-(Bo,Bi)/Me, where 
Me is the subspace of J-"(Bq, B{) consisting of all functions f G J-(Bq,Bi) 
such that f(9) = 0. 

Proof. First, we observe that Me is a closed subspace of J-"(B , Bi),so that the 
quotient space 3F{Bq, B\) / Mo is a Banach space. We consider the mapping 
/ i — y f{9) from J-(B , B\) to B + B\. Clearly, the image of the mapping is 
[B , Bi] e , and the kernel Me- Since 



11/(0) II Bo+Bi < ma x < sup ||/(iy)||B +Bi, sup ||/(1 + iy)\\ Bo +Bi \ < \\j \\r 
I y y ) 

for each / G jF{Bq,B\), the mapping is bounded, and so J-(B , B-^/Mq is 
isomorphic to [B , Bi]e- □ 

We now observe that the complex interpolation method can be flipped in 
a natural way; the proof is a straightforward application of the definitions 
and is thus omitted. 

Proposition 2.30. For each 9 G [0, 1], we have the isomorphism 

[Bo^B^e = [Si, -B o ]i-0- 

The complex interpolation method behaves well under reiteration. The 
proof is rather elaborate and we omit it: see § §2.7.4 for a discussion. 



Theorem 2.31 (Reiteration theorem). Let (B ,Bi) be a Banach couple. Fix 
6>o, 6>i G [0, 1] and set 

Xj = [B , Bi]e r 

If Bq fl B\ is dense in each of the spaces Bo, B\, and Xq D X\, then we have 
the isomorphism 

[X ,Xi]e — [-Bo jBi] (i_e)0o+00i 

for all 6 G [0, 1] . 

Finally, we show that the method of complex interpolation is truly a 



generalization of the Riesz-Thorin interpolation theorem 



Theorem 2.32 (Complex interpolation is exact). Let (B Q ,Bi) and (Cq,Ci) 
be two complex Banach couples and T : B + B\ — > Co + C\ a bounded linear 
operator. For each 9 G [0, 1], the restriction ofT to [B , Bx]g maps boundedly 
into [Co, Ci]e and satisfies the norm estimate 

IIT^II <^ 1 1 ^7 - ' 1 1 1 — ^ ll / T 7 ll^ 
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Proof. Fix e [0,1], v e [B Q ,Bi] e , and e > 0. We shall show that 

\\Tv\\ Ce <A^A;>|| Be , 

where 

ko = \\T\\b ^Co an< i ^1 = 11^11 Bi->Ci- 

By the definition of the Bq norm, we can find an / G J-(B ,Bx) such that 
f(6)=v and \\f\\ T <\\v\\ Be +e. ' 
We claim that 

g{z) = kt 1 kT'[Tf(z)] 

belongs to J 7 (Co, C\) and satisfies the norm estimate \\g\\r < IMI.Be + £ - The 
continuity of g is clear. Since / G T(Bq,Bi), we have the bound M > 
\\f(z)\\B +Bi f° r ah z E S, and so we have the estimate. 

\\g(z)\\c 0+Cl = K- l k- l z \\Tf(z)\\ Co+ c 1 

< k z Q l k 1 z \\T\\ B{)+Bl ^ Co+Cl \\f(z)\\ Bo+Bl 

< kQ~ 1 ki Z \\T\\ Bo+Bl ^c +c 1 C, 



Therefore, (a) in Definition 2.26 is satisfied. Given any bounded linear func- 
tional I on Co + C\, the map 

lg(z) = l(kt l k^[Tf(z)]) = k z - 1 k- 1 z [lTf(z)] 

is holomorphic in the interior of S, for IT is a bounded linear functional on 
Bq + Bi and / G J-(B , B^. This establishes (b). (c) and (d) follow from 
the rapid decay of k^~ l k^ z , and so g is in J-(Cq, C\). We also observe that 



\g\\ T = max < sup \\g(iy)\\ Co , sup \\g(l + iy)\\ Cl 
I y y 

< max \ sup \k l y \\k~ w \\\f(iy)\\ Bo , sup \k v \\k^ y \\\f(l + iy)\\ Bl 



max <j sup || f(iy) \\ Bq , sup ||/(1 + iy)\\ Bl 
y y 



< \\v\\B a +£, 



as was claimed. 
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It now follows that 

\\v\\b 9 + £> \\g\W > \\9K u )\\b, 
whereby we have the estimate 

\\Tv\\ Cg < ^4(^^+8 
Since e > was arbitrary, we have the inequality 

||Tv||o s < fco~ 9 *iNk 
for all v G Bq, and the proof is now complete. 



□ 



See §2.7.3| for the definition of an exact interpolation space. Calderon 
presents another complex interpolation method in |Cal64j . which leads to a 
study of dual spaces of complex interpolation spaces. See §£ 2.7.4 for a quick 
sketch. 



2.2.2 Interpolation of L p Spaces 

Since we have motivated the method of complex interpolation as a general- 



ization of the |Riesz-Thorin interpolation theorem[ it is natural to expect that 
Riesz-Thorin has been incorporated into the theory as a special case thereof. 
For simplicity's sake, we prove the theorem only on M. d . 

Theorem 2.33. Given po,pi G [l,oo], we have 

[L P0 {R d ),L pl {R d )] e = L PB (R d ), 



for each 9 G (0, 1), where 



Proof. We first show that C 



) is a dense subspace of L po ( 



.]jn 



in 



the L po + L P1 norm given in Proposition 2.25 To see this, we fix an arbitrary 
/ G L P0 + L Pl and find /„ G L Po and f\ G L Pl such that / = /„ + f v By 
Corollary 1.25, we can find two sequences (<p n )^=i an d 
ll/o - fn\\po ->■ and \\f x 



n )^=i such that 



^nWpi — * as n — > oo. Therefore, we have 



lim \\f - ((f n + <Pn)\\LPo+LPi < lim ||/o V 5 " ||po 



'n\\pi 



0. 
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as desired. 

It now suffices to show that 



|[Lfo(M<i),LPi(Rd)] e — || J Upo 

for all / G C™(R d ). To this end, we fix an / G C^°(R d ), pick an e > 0, and 
define 



for each z G S, where 



p(z) 



& \f(x)\ P/ * Z) f(x) 



1 — z z 

+ — 

Po Pi 



We assume without loss of generality that ||/|| Pfl = 1 by renormalizing if 
necessary Observe that $ G F (L Po (R d ) , L P1 (R d )) and < e £ . Since 

$(0) = /, we have and so < ||/|| w . 

To establish the reverse inequality, we recall the following consequence of 
the Riesz representation theorem| 



Pe 



sup 

gSCJ°(K d ) 



fg- 



We fix such a g and set 
for each z G S, where 



Z i_ £d 2\g(x)\p'e/P'^g(x) 



1 



1 - z 



p'(z) p' 
We assume without loss of generality that 
necessary. Observe that 



\9{ x ) 

z 



1 by renormalizing if 



3> fx) 1 ]/. (x) cfe 



satisfies the bounds 



| < e e and + iy)\ < e 



2e 



for all i/gM. It now follows from the three-lines lemma that |H(z)| < e 2e for 
all z G 5, whence ||/|| Pe < ll/llrsi- This completes the proof. □ 



The Riesz-Throin interpolation theorem| now follows as a direct conse- 



quence of Theorem 2.32| and Theorem 2.33 



2.2. THE COMPLEX INTERPOLATION METHOD 



101 



2.2.3 Interpolation of Hilbert Spaces 

We conclude the section by studying another example of interpolation spaces, 
which we shall have an occasion to use later in the chapter. Let if be a 
separable Hilbert space and T a self-adjoint operator on H. We assume 
furthermore that T is positive (see §£2.1.6). The spectral theorem (Theorem 



2.23) implies that there is a unitary operator U : H — )■ L 2 (X,fi) and a 



real- valued //-measurable function a on X such that 

D = UTU~ l u{x) = a(x)u(x) 

for all u G L 2 (X,fi). It then follows that V(T) = U-\V(D)), where 

V(D) = {ue L 2 (X,/2) : au G L 2 (X,/x)}. 

We shall assume that a(x) > 1, which is equivalent to the assumption 
that 

(Tv, v) > \\v\\ 2 . 

Note that if a is bounded, then T>(D) = L 2 (X, /i) and V(T) = H, whence T 
must be a bounded operator (Proposition 2.21). For each 9 in the strip 

S = {z EC:0<Rez<l}, 

we define T e to be the operator \J~ x D d V where 

D e u(x) = a(xfu(x). 

The associated domain T>{T e ) is the preimage U~ l {T>(D e )), where 

V(D 9 ) = {ue L 2 (X,fi) : a e u G L 2 (X,/i)}. 

We now characterize the interpolation spaces between H and T>(T) 

Theorem 2.34. Let H and T be defined as above. For each 9 G [0, 1], we 
have 

[H,V(T)} e = V(T e ). 
Proof. Fix v G V(T e ). If we let 

f(z) = T- z +%, 
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then / e F(H,V(T)) and v = f(6). Conversely, for each / e F(H,V(T)) 
and every e > 0, we note that 

\\T z (I-iET)- l f(z)\\ H < sup max { \\{I - ieT)' 1 ^ z{iy)\\ H , 

y 

UT 1+i y(I + i sT)- 1 z(l + l y)\\ H } 

< c 

by the maximum modulus principle, where C is a constant independent of e. 
It thus follows that f{6) G V(T 9 ), and the proof is complete. □ 



We shall use this result in 592.6.3 when we characterize fractional-order 



Sobolev spaces via the Fourier transform: see the proof of Theorem 2.76 
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2.3 Generalized Functions 

In the following two sections, we set up the stage for the interpolation the- 



orem of C. Fefferman and E. Stein, which we study in £2^3 We introduce 
Laurent Schwartz's theory of distributions in this section and apply it in the 
next section to the study of a classical singular operator called the Hilbert 
transform. 



We recall the remark from §1.3| that the output of the Fourier trans- 
form sometimes cannot be described as a function. To provide a rigorous 
explanation for this remark, we introduce a particular kind of "generalized 
functions", known as tempered distributions. 

2.3.1 The Schwartz Space 

The starting point of distribution theory is that linear functionals acting on 
a space of "nice functions" is easier to deal with than the functions they rep- 
resent. In the context of harmonic analysis, the natural candidate for such 



a space is the |Schwartz space[ which we have seen to be closed under differ- 
entiation and the Fourier transform. Even better, the Schwartz space is also 
closed under other fundamental operations in harmonic analysis: translation, 
rotation, reflection, dilation, and convolution. The first four assertions follow 
from trivial computations, so we omit the proof. 

Proposition 2.35. // ip E y(R d ), then Thf, e^y?, (p, and 5 a tp are in y(R d ) 
for each h G IR and every a > 0. 

To see that the Schwartz space is closed under convolution, we recall 



Theorem 1.41 which states that the Fourier transform turns convolution 
into pointwise multiplication. Since the Schwartz space is closed under the 
Fourier transform, pointwise multiplication, and the inverse Fourier trans- 
form, the convolution theorem implies that the Schwartz space is closed under 
convolution. 

Proposition 2.36. If<p,(f>£ y(R d ), then <p * <p e y(R d ). 

Proof. ip*ct)= (^^0) v = (<£0) v . □ 

Let us now return to the task of examining linear functionals on the 
Schwartz space. We are only interested in the bounded ones, for they are the 
ones that are well-behaved under limiting operations. This, in turn, requires 
us to give a topology on y(M. d ). 
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To do so, we consider the natural Iseminorms 



Pap(<p) 



sup \x a D p <p(x)\ = \\x a D fi (p(x)\\ t 



for each pair of multi-indices a and (3. We observe that each p a/3 is complete, 
in the sense that p a p{Pn — V?m) - > as n, m — > implies that there exists 
a cp G y(R d ) such that p a/ 3(ip n -</?)-)• as n -)■ oo. Indeed, we see by 
setting a = (3 = that (<£ n )5JLi is uniformly Cauchy, whence we can find 
a function ip : IR d — > C to which the sequence converges uniformly. All 
other convergences are uniform as well, and so the decay and smoothness 
conditions are trivially established. 

We now arrange the seminorms into a sequence (p n )™ =1 and consider the 
metric 

y> JL_ ( p n ((f - 
L 4 On 



d{ip,(j>) 



n=l 



2" \l+p n (if 



on ^(IR d ). It is easy to see that d(<fk,<j)) — > as — > oo if and only 
p n (ipk — f) — > for all n. This, in particular, implies that d is a complete 
metric. It therefore follows that y(R. d ) is a Frechet space. The Frechet- 
space topology on y(R. d ) is commonly referred to as the strong topology on 
y(M. ). Consequently, a sequence converging in the strong topology is said 
to converge strongly. 

We now recall that the Fourier transform is a linear automorphism on 
y(M. d ). If (<p n )%Li is a sequence in y(M. d ) converging strongly to ip E ^(IR d ), 
then 



lim \(p n (0 ~ 0{O\ 



lim 

n— >oo 



(<p n (x) - <p(x))e- 2 ™< dx 



< 



lim / \<p n (x) - p{x)\\e- 2mx ^\dx 

n—toc 



lim \(p n {%) ~ v( x ) \ dx = 



by the uniform convergence of (<^ n )^Li- The same convergence result can 
easily be established for all p a p. Carrying out an analogous computation 
for the inverse Fourier transform, we see that the Fourier transform is a 
homeomorphism on y(Mr) onto itself. In other words, the Fourier transform 
is a topological automorphism on y{Mr). 

To wrap up the above discussion, we now collect some basic properties 
of y(M. d ). Of course, the topology on y(R d ) in the following proposition is 
the strong topology. 
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Proposition 2.37. The following are basic properties of the Schwartz space: 

(a) ,y(M. d ) is closed under translation, rotation, reflection, differentiation, 
convolution, and the Fourier transform. 

(b) The Fourier transform is a linear automorphism and a topological auto- 
morphism of y(R d ). 

(c) For each pair of multi-indices a and (3, the map <p(x) >->■ x a D /3 ip(x) is 
continuous. 

(d) IftpE Y(R d ), then r h (p ->■ tp as h ->■ 0. 

(e) Let h — (0, ... , h n , . . . , 0) lie on the nth coordinate axis of M. d . For each 
up G y(R d ), we have 

if- r h (p d 

km — = - — Lp. 

\h\-+o h n dx n 

Proof, (a) and (b) were already established in this subsection, (c) is a triv- 
ial consequence of the /^-convergence implying the p( 7+a ) ^.^-convergence. 
Likewise, (d) and (e) are easy consequences of the definition of strong con- 
vergence in y(EL d ). □ 

It is important to note that strong convergence implies L p convergence. 
In fact, the L p -norm of a Schwartz function is dominated by a finite linear 
combination of Schwartz norms: 

Theorem 2.38. If ip G y(R d ) and p G (0,oo], then, for some constant C p ^ 
depending only on p and d, 

\\D p (p\\ p < C p4 PocM 

|a|<[^±i]+l 

whenever the right-hand side is finite. Here [^] is the greatest integer 
smaller than or equal to 

Proof. The proof is trivial for p = oo, so we assume that p < oo. We let Ud 
denote the volume of the <i-dimensional unit ball and break the L p -norm of 
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D^ip into two pieces: 



\\D^\\ t 



\D^{x)\ p dx + / \D^(x)\ p dx 

\x\<l ~/ |aj|>l 



1/p 



\D^(x)\ p dx + / \x\ d+1 \D^(x)\ p \x\-^ d+l Ux 

\x\<l ~/|aj|>l 



1/p 



:i ^|x|>i / 

< (u> d \\D f) <p{x)\\ p 0O + sup \x\ d+1 \D^<f(x)\ p [ \x\- {d+1) dx 
V xeR d J\x\>i 



i/p 



We set C' d to be the maximum of Ud and f |x| rfrr. For an 



'N>i 



appropriately chosen constant > 0, we have the following estimate: 



\\D^\\ P < (C^ d r[\\D^+ sup \x\ d ^\D^(x)\ p 



i/p 



(PMl P oo) 1/P +(sup \x\ d+1 \D^(x)\ p 



i/p 



(^qJlllD^IU+sup \x\^\D^{x 



< (o Ptd yc; p^iu + sup |x|^i +1 |i>^(x) 

We now set set C' p " d to be the minimum of the map 

x !->■ ^ 

|a|=[^]+l 



on the unit sphere |x| = 1. C pd is positive, for this map has no zero on the 
unit sphere. This, in particular, implies that 



\x 



|a?,+1 <C £ 



\X' 



|a|=[^]+l 
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for all |x| = 1, and we can extend the inequality to all \x\ > by renormal- 
izing x. Since 

PVIIp < {C' p , d yC>; f pMloo + snp \x\^ +1 \D^(x)\) , 



we let 

to conclude that 



^HWxmax{l,Q 

V 



IPVIU + sup \x a \\D P (p{x) 



/ 



/ 



Therefore, the L p -norm of a Schwartz function is dominated by a finite linear 
combination of p^-norms, as was to be shown. □ 



2.3.2 Tempered Distributions 

We are now ready to consider the continuous linear functional on y(R, d ). 

Definition 2.39. The space of tempered distributions is the continuous dual 
space Y'(R d ) of Y(R d ). In other words, ,¥"(R d ) consists of the bounded 
linear functionals on y(R d ). 

In what sense are tempered distributions "generalized functions" ? Given 
a function / e L p (R d ) for some 1 < p < oo, we set 



h(v) = J f{ x )vi x ) dx 



for each (p G y(R d ). Lf is clearly finite and is a linear functional on y(R d ). 
To show that Lf is continuous, it suffices to establish the continuity of Lf 
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at the origin. To this end, we pick a sequence (ip n )^=i hi S^(R d ) converging 
strongly to 0. This, in particular, implies that p a o(Pn) — > as n — > oo, 



whence Theorem 2.38 implies that ||<£> n ||p' — * as n — > oo. It now follows 



from Holder's inequality that 

lim \lf((p n )\ < hm = 0, 

n— >oo n— >oo 

and so l f E y'(R d ). 

At this point, we take a moment to introduce a new notation. Given 
a tempered distribution u and a Schwartz function ip, we shall denote the 
action of u on (p by 

u((p) = ((p,u). 



To see why we use the inner-product notation, we recall that the |Hilbert- 
space F. Riesz representation theorem! yields an isomorphism 



u H> (-,u) n . 

from a Hilbert space "H to its dual "H*. Since each bounded linear functional 
/ on "H corresponds uniquely to an element of "H, we may consider L as an 
element of "H and write 

Iv = (v, L). 

Following this identification, we use the same inner-product notation for the 
Schwartz space and bounded linear functionals thereon. With this notation, 
we identify the L p -function / with the associated bounded linear functional 
Lf on ^(IR^) and write / to denote the tempered distribution. In other 
words, we write (<p,f) to denote Lf((p). 

Let us consider a few more canonical examples of tempered distributions. 
For each / e L 1 (R d ), we can associate a complex Borel measure /z/ defined 
by 

V'fi.E) = / f(x)dx 



E 



for all Borel sets E C R d . In this sense, the space ^#(R ) of complex Borel 
measures on IR d contains L 1 (IR d ). Now, for each /i G ^(M> d ), we consider the 
linear functional 

l M = / <p(x)dfi(x) 



on y(M. d ). To show that / M is bounded, we pick a sequence (v? n )^L 1 of 
Schwartz functions converging strongly to and observe that 

lim |^(0| < lim ||^ n || ly u(M d ) = 
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by Holder's inequality. Similarly as above, we identify the measure fi with 
the associated tempered distribution. 

An important special case is the Dirac 5 -distribution, defined for a fixed 
point x G lR d to be the measure 



8 X {E) 



1 if x G E 
lix^E. 



For each tp G S^(M. d ), the tempered distribution 5 X satisfies the identity 

(<p,5 x ) = <p(x). 

A basic property of the Dirac 5-distribution is that 5° * ip = 4> for all 



Schwartz functions ip. See §£2.3.3 for the definition of convolution of a tem- 
pered distribution and a Schwartz function. The property is a trivial conse- 
quence of the definitions presented in the subsection. Moreover, the Dirac 
^-distributions are "atomic" examples of distributions with point support. 



See §£ 2.7.2 for the precise statement of the theorem, as well as the definition 
of support of a distribution. 

Yet wider classes of functions and measures are tempered distributions. 
We recall that ip : M. d — > C is a Schwartz function if and only if <p satisfies 
the growth condition 

sup (x) n \D l3 (p(x)\ < oo 



for each positive integer n and every multi-index /3, where 

(x) = yl + x 2 . 

Therefore, if a measurable function / : IR d — > C satisfies (x)~ n f(x) G L p (R d ) 
for some postive integer n and 1 < p < oo, then 

<¥>,/> = Jf(xMx)dx = j[(xr n f(x)][(x)Mx)}dx 

is a tempered distribution. In this case, we say that the function / is a 
tempered L p -junction. Similarly, we say that a Borel measure /i, real or 
complex, is a tempered measure if the total variation \y\ of \i satisfies the 
bound 

(x)' n d\fi\(x) < oo 
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for some positive integer n. If // is a tempered measure, then 

{(p,fi) = J (p(x)dfi(x) 

is a tempred distribution. 

In general, a linear functional I on y(R. d ) is a tempered distribution 
precisely in case I is bounded by a finite linear combination of Schwartz 
norms. 

Theorem 2.40. A linear functional I on ,y(M. d ) is a tempered distribution 
if and only if there are integers m and n and a constant C > such that 



\a\<m 
\8\<n 



for allipe y{R d ). 



Proof. It is clear that the existence of such m and n implies the continuity of 
I. Conversely, we suppose that I is continuous. Observe that the collection 
of sets 



\a\<m 
\8\<n 



for all e > and m, n G N and their translates form a subbasis of the strong 
topology on y(R d ). Therefore, we can find e, m, and n such that | (</?, I) \ < 1 



on N £ m n . 



For each ip G y(R d ), we set 

imi = Yl 

\a\<m 
\p\<n 

Fix Eq G (0,e) and note that 

for all if G ,5^(lR d ). Therefore, we have 

I to 01 = raK^OI<i, 
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or 



K</M>l< 




1 



|a|<m 
\P\<n 



It follows that C = e 1 is the desired constant, and the proof is complete. □ 

2.3.3 Operations on Tempered Distributions 

Generalized functions, of course, would be mere abstract nonsense if they 
existed only for the sake of generalization. Before we can study applications 
of tempered distributions, we must extend the familiar operations in analysis 
to this general context. We thus return to the six operations in harmonic 
analysis that we have discussed in the beginning of this section: translation, 
rotation, reflection, dilations, convolution, differentiation, and the Fourier 
transform. 

The guiding principle for the extensions we shall study is that an operation 
on a generalized function manifests itself by operating on the test functions. 
The action of a tempered distribution is studied by "testing it out" on each 
Schwartz function. It is therefore natural to define operations on tempered 
distributions by the action of the operations on Schwartz functions: 

Definition 2.41. Let u be a tempered distribution. 

(a) Given h G R d , the translation of u with respect to h is defined by 



for each ip G y(R d ). 

(b) Given h G R d , the rotation of u with respect to h is defined by 

(<p,e h u) = (e_ h (p,u) 

for each <p G y(R d ). 

(c) The reflection of u is defined by 



(<p, r h u) = (r- h (p, u) 



(ip,u) = (<p,u) 



for each ip G y{R d ). 
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(d) The dilation of u is defined by 

(ip,5 a u) = (a~ d 5 a -np,u) 

for each a > 0. 

(e) Given ip G S^(R d ), the convolution of w and ^ is defined by 

(ip, u * ip) = (xjj * (p, u) 

for each <p G y(R d ). 

(f) Given a multi-index /3, the partial derivative of u with respect to f3 is 
defined by 

(p,-D"u) = (-l)l^(^u) 

for each <p G 

(g) The Fourier transform of u is defined by 

(<p,u) = (<p,u) 

for each ip G «^(K d ). 

The above definitions are direct generalizations of their analogues for 
ordinary functions. If u is an //-function, then 

(cp,u) — / u(t)(p(t) dt = / u(t)(p(t) dt = (<p,u) 



by the multiplication formula (Theorem 1.33). With this new definition of 
Fourier transform, translation, rotation, and reflection are defined in a way 



that makes Proposition 1.29 true. Differentiation of tempered distribution is 
also straightforward, as we have 



((p,D p u) = J[D^u{x)\ip{x)dx 

= (-i)i"i<cV«) 
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via integration by parts, provided that u and tp have enough smoothness 
conditions. Finally, Fubini's theorem shows that 

(ip,u*ip) = J (u * i/))(x)ip(x) dx = J u{x){xp * (p)(x) dx = (xp * (p,u). 

It is easy to see that the space of tempered distributions is closed under 
the six operations. Furthermore, the basic properties of the six operations are 
preserved as well. In the following proposition, we state the generalizations 
of Proposition 1.29, Proposition 1.30 Proposition 1.32, and Theorem 1.35 in 



the context of tempered distributions. The proof of the following proposition 
is a direct adaptation of the corresponding statement on the Schwartz space 
and is thus omitted. 

Proposition 2.42. Let u £ y(R d ), ip G y(W i ), and h G R d . 

(a) iyu = e_ h u. 

(b) e^u = T h u. 

(c) u = u 

(d) If P is a polynomial in d variables, then 

P(D)f(£) = &{P(-2mx)f(x))(Z) and &{P(D)f)(£) = P (2ttz0/(0- 

(e) The inverse Fourier transform, defined by 



for each <p G ,y{R d ), is well-defined on S?'{ 



We have (u) 



u, 



and the Fourier transform is a linear automorphism on 



2.3.4 Convolution Operators and Fourier Multipliers 

The reader may have noted that we have not said anything about the con- 
volution operation. It is an odd one, indeed: for one, convolution is an op- 
eration between a tempered distribution and a Schwartz function, whereas 
all the other operations were those of two tempered distributions. As such, 
the convolution operation possesses a number of special properties we now 
study. 

We first observe that the convolution of a tempered distribution and a 
Schwartz function is not only a tempered distribution, but also a function. 
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Theorem 2.43. If u G y"{R d ) and ip G y{R d ), then u * ip is a map, 
in the sense that the function 

f(x) = (u*if))(x) = (r x i>,u) 

is in e if° c (M. d ). Furthermore, each multi-index (3 admits constants Cp and ng 
such that 

\DP(u*iP)(x)\<Ce{x}^, 
whence u* ip is a tempered distribution. 

Proof. We first show that / G C°°(IR d ). For each 1 < n < d, we write 
h n = (0, . . . , h n , . . . , 0). Proposition 2.37[ e) implies that 



T x+h nljj ~ T x ljj ( dljj 

hm = -t x - — 

\h\-+0 h n yOX n 

in the strong topology. By continuity of u, we have 

hm M = nm ^4^i) = i Tx f|i , , u 

h n -+0 h n h„^0 h n \ \ ox r 

We ca rry o ut an analogous argument for each n and conclude from Propo- 



sition 



2.37 



d) that / G C 1 (lR' i ). Since is a Schwartz function, we can 



reiterate the argument to prove that D@f exists and is continuous for each 
multi-index (3. 

We now show that each D° f satisfies the slow-growth condition for a 
tempered distribution. Theorem |2 .40 furnishes a constant C > and integers 



m and n such that 



\f(x)\ = \(rj,u)\< PapirJ). (2.7) 



|o|<m 
\0\<n 



Since 

p afi (rj) = sup \y a (D^)(y - x)\ = sup \(y + x) a (D^)(y)\ 



is bounded by a polynomial in x, 

(D^f)(x) = (-l)^(r x D^,u) 
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combined with (2.7) establishes the claim. 

It now remains to show that u*ip is the function /. To this end, it suffices 
to prove the identity 



(<p,u*i(>) = J <f(t)f(t)dt. 

To this end, we observe that 

((p, u* ip) = (ip * <f, u) 

ip(x — t)(p(t) dt, u 

(r t fy(x)ip(t) dt,u 

We now note that the Riemann sums of the last integral converge in the 
strong topology, whence 



(T t ip)(x)<p(t)dt,u) = J (T t ip,u),tp(t)dt= J f(t)<p(t)dt. 
The claim follows from the linearity and continuity of u. □ 



We now recall the convolution theorem (Theorem 1.41), which describes 
the close relationship between pointwise multiplication and the convolution 
operation. We shall need a generalization of these theorems in the context 
of distribution theory. To this end, we must make sense of pointwise multi- 
plication between a tempered distribution and a Schwartz function. 

Definition 2.44. If u G y{R d ) and ip G y(R d ), then the pointwise multi- 
plication of u and ip is defined to be 

(cp,uip) = (ip<p,u) 

for each <p G y(R d ). 

The following result is now a trivial consequence of the definition and the 
properties of tempered distributions: 

Theorem 2.45 (Convolution theorem on y"(R d )). If u G y'(R d ) and ip G 
y(R d ), then 

u* ip = uip and uip = u* xjj. 
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A higher, more abstract viewpoint is now in order. Since the convolution 
between a tempered distribution and a Schwartz function produces a func- 
tion, we can consider the convolution operation as an operator on J^(R d ). 
Since ^(R ) is a dense subspace in many function spaces, this operator often 
can be extended via Theorem 11.111 

Definition 2.46. Let V be a normed linear space of measurable functions 
on M. d containing ^(lR d ) as a dense subspace. A convolution operator on V 
is a linear operator on V that can be written as 

Til) = u * tj) 

for all / G y(M, d ), where u is a tempered distribution determined uniquely 
byT. * 

Of course, the most important cases of convolution operators are those 
between Lebesgue spaces. 

Definition 2.47. Given 1 < p, q < oo, the space ^ P}Q is defined to be the 
collection of bounded convolution operators on L p (IR d ) into L q (M> d ). 

Since there is a one-to-one correspondence between each convolution op- 
erator and its associated tempered distribution, we shall abuse the notation 
and speak of tempered distributions in <M PA . 

We shall primarily be concerned with the space ^# P)P . Observe that ^ PiP 
is precisely the collection of bounded operators that are basically the Fourier 
transform: 

Definition 2.48. Fix p G [1, oo). A bounded linear operator T : L p {R d ) — >• 
is an IP Fourier multiplier if there exists a bounded function m such 

that 

Tip = (mif)) v 

for all ip G y(E. d ). m is referred to as the symbol of the Fourier multiplier 
T, and the collection of all symbols of LP Fourier multipliers is denoted by 



It follows from the convolution theorem (Theorem 2.45) that m G if 
and only if the associated Fourier multiplier T m is in ^ PiP . We shall study 
an important example of a Fourier multiplier in the next section. For now, 
we content ourselves with concrete characterizations of two important special 
cases. 
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Theorem 2.49. A convolution operator Tip = u* ip is in if and only 
if u is a complex Borel measure. In this case, 

\\T\\ L i^ L i = \u\, 

the total variation of u. It thus follows that M. 1 is the space of the Fourier 
transforms of complex Borel measures. 

To prove the above theorem, we shall need the following representation 
theorem: 

Lemma 2.50 (Riesz- Markov representation theorem). The dual of the space 
Co(M d ) of continuous functions on M. d vanishing at infinity is isomorphic to 
the space M(K d ) of complex Borel measures on M. d , in the sense that each 
bounded linear functional I on Co(M. d ) furnishes a complex Borel measure fi 
on M. d such that 

Kg) = J g(x) dn{x) 

for each g G C (R d ). 

The proof of the representation theorem can be found in many standard 
textbooks in measure theory: see, for example, Theorem 6.19 in |Rud86] or 
Theorem 7.17 in [Fol99j. 

Let us now return to the task at hand: 

Proof of Theorem \2.49\ If fi is a complex Borel measure, then Young's in- 
equality 

IWI < HIHIi 

continues to hold, whence fi G To prove the converse assertion for an 

arbitrary u G we define the Gauss-Weierstrass kernel 

W(t,e) = (4 7 re)~" /2 e- | * |2/4£ 



for each e > 0: we have used the kernel in Proposition 1.34 and the proof of 
the Fourier inversion formula (Theorem 1.35) already. Note that 

W{t,e) dt = 1 



for all e > 0. Therefore, (W(t,e)) £> o is an approximation to the identity 



and u £ (x) = (u * W(t,e))(x) satisfies the estimate 

|k||i<A||W(t,e)|| 1 = C 
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for some constant C > that does not depend on e. 

We now consider L 1 (]R c( ) as an embedded subspace of the space M(M. d ) of 



complex Borel measures on M. d . By the Riesz-Markov representation theorem 
M(R d ) is the dual of C (M. d ), whence we can endow M(IR d ) with the weak-* 
topology with respect to Co(M d ) (Definition 2.16). The Banach-Alaoglu theo- 



rem (Theorem 2.17) now implies that the unit ball in M(M. d ) is compact, and 
so a "subsequence" (u eh )'%L 1 of the approximation to the identity converges 
to a measure /i E M(M> d ) in this topology. In other words, we have 



lim / ip(x)u £ ,(x) dx — / ip(x)dfi(x) 



(2- 



for each ip E y(R d ). 

It remains to show that fi is the distribution u. To this end, it suffices to 
show that 

(ip,u) = / ip(x) dji{x) 



for each ip E y(R d ). We fix a ip E ,Y(R d ) and set 

ip £ (x) = {ip{t)*W{t,e)){x) = J ij){x-t)W(t,e)dt 
for each e > 0. For each multi-index /3, we have 

(D^ £ )(x) = {D p ijj{t)*W(t,£))(x) = [ (D^)(x-t)W(t,e)dt 



The calculation in the proof of the Fourier inversion theorem (Theorem 1.35) 
shows that D^tp £ converges uniformly to D^ip. Therefore, (ip e )e>o converges 
strongly to ip as e — > 0, and so (ip e ,u) ~ ► (4>, u ) as £ — > 0. Since W(t,e) = 
W(t,e), we have 

(ip e , u) = (W(t, e) * ip(t),u) = (ip,u* W(t, e)) = / ip(x)u £ (x) dx. 



It now follows from (2.8) that 



(ipju) = lim (tp £k ,u) = lim / ip(x)u £k dx 

fc— »oo k— ¥oo 



ip(x) dfi(x) 



as was to be shown. 



□ 
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We also have a nice characterization for 



'2,2- 



Theorem 2.51. A convolution operator Tip = u * ip is in ^#2,2 if and only 
ifue L°°(R d ). In this case, 



00 • 



\\T\\ L 2^ L 2 = \\U\ 

It thus follows that M 2 = L°°(R d ). 

Proof. Let u G ^#2,2- For each ip G J^(R d ), the convolution theorem (Theo- 



rem 



2.45) implies that 

uip — u * 1p, 

for each ip G S / "(R d ), whence by Plancherel's theorem (Theorem 1.37) we 
have 

\\u$^2 = \\u * ^\\2 = \\u * lj)\\2. 
Since u G ^2,2, there exists a constant C > such that 

\\u*il>\\ 2 < C\ 



for all ip G S^(R d ). Applying Plancherel's theorem once more, we obtain the 
norm estimate 

||tt^|| 8 < C7||V»|| 2 = C7||^|| a 

for all i) G y(R d ) 

The Fourier transform is an automorphism on S^(R d ), and so the above 
is equivalent to 

IN>|| 2 <c|M| 2 

for all ip G y(R d ), where 

itip = u{jp y ). 



We now invoke Theorem |1.11| to extend the multiplication operator 

■ip 1 — y u 

as a bounded operator from L? into itself. By the norm-preserving property 
of the extension, it follows that H^Hoo < C 

Conversely, if u G L°°(R d ), then Plancherel's theorem and the convolution 
theorem imply that 

||u*^|| 2 = ||m^ v || 2 = ||w^ V ||2 < Hwllooll^lb = INUIMh 

for each ip G <5?(R d ). It follows that u G A-^2,2, and the operator norm of the 
associated convolution operator is easily seen to be H^Hoo- □ 
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Characterization of the space is trickier. It is a standard result 

in harmonic analysis that coincides with the space of bounded linear 

operators T : L p (R d ) — > L q (M. d ) that commute with translations, viz., 

T(r h f) = r h Tf 

for all / e L p (R d ); see Chapter I, Theorem 3.16 in |SW71j or Theorem 2.5.2 
in |Gra08a] for a proof. The only other known result so far is the duality 
relation 

that holds for p, q £ [l,oo]. A proof can be found in |SW71j . Chapter I, 
Theorem 3.20, or in |Gra08aj . Theorem 2.5.7. 

Fourier multipliers are special cases of pseudodifferential operators, which, 
in turn, are special cases of Fourier integral operators. We shall briefly 



study pseudodifferential operators in §£ 2.6.3 and Fourier integral operators 
in § p^4| 
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2.4 The Hilbert Transform 

In this section, we study a particularly important example of a convolution 
operator, the Hilbert transform. The Hausdorff- Young inequality tells us 
that the Fourier transform is bounded on LP for p G [1,2], but it is not very 
clear how the L p -theory of the Fourier transform should be developed for 
p > 2. To this end, we shall consider a Fourier multiplier whose symbol is a 
constant, suitably normalized to retain the validity of Plancherel's theorem. 
We shall see that this operator is bounded on LP for all p G (1, oo). 



2.4.1 The Distribution pv(^) and the I? Theory 

Recall that a measurable function on a subset E of lR d is locally integrable if 
/ is integrable on every compact subset of E. If a locally integrable function 
/ barely fails to be integrable near the origin, then it is often possible to 
compute the principal value of its integral, defined by 

pv / fix) dx = lim / fix) dx. 

J £ ^°J\x\>e 

Definition 2.52. Let / be measurable on Mr and locally integrable on W 1 \ 
{0}. The principal-value distribution pv(/) is defined to be 

(<P, P v (/)) = lim / f{ x )v( x ) dx 

for each p G J^(R ), provided that the limit exists for all such tp. 

Proposition 2.53. The one- dimensional principal-value distribution pv(-) 
is a well-defined tempered distribution. 

Proof. Fix if G J^(1R) and suppose that e < 1. We write 

(p ®dx= f ^dx+ [ ^dx. (2.9) 



'\x\>£ X Jl>\x\>£ X J\X\>1 X 

Since the Schwartz function ip decays rapidly at infinity, the second integral 



in the right-hand side of (2.9) converges. As for the first one, we note that 

— dx = 0. 

l>[x[>e X 
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Therefore, 



(p(x) 



dx 



1>M>£ 



x 



(p(x) - ip(0) 



dx < 



l>|x|>£ 



X 



sup \(p [X)\ \x 



dx, 



1>M>£ 



x 



and so the integral in the left-hand side of (2.9) converges. The above com- 
putation also yields the estimate 



lim 


( ^ dx 


e^0 J 


'\x\>e X 



\a\<l 
I/3|<1 



for some constant k, whence it follows from Theorem 2.40 that pv(-) is a 
tempered distribution. □ 

Definition 2.54. The Hilbert transform % is the convolution operator 

%ip = — p V [ - J * 1p. 

7T \X J 

As was alluded to above, we now show that the Hilbert transform is a 
Fourier multiplier. 

Theorem 2.55. % is an L 2 Fourier multiplier with the symbol —isgn(x). 
Proof. For each ip e ^(lR d ), we observe that 



V>,pv 



^,pv 



lim 

e-s>0 



lim 

e->0 



l€[>e f 



ip(x)e' 



-2tti^x 



lim 

e^O /i 



lim 

£->0 



\£\>e J -oo S 

00 ^(x)e- 2 ^ x 



>|£|>E"'-00 S 

x /■ ^( X ) e - 2 ^ 



da; d£ 
d£ dx 



lim / ijj(x) 



sin(27r^x) 



d£ I da;. 
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Since 



sin£ 



dx 



< 4 and 



sin(5£) 



dx = it sgn(6) 



for all < a < b < oo, we see that the quantity 

sin(27r£x) 

7>l€l>£ f 



are uniformly bounded by 8 and converges to 7rsgn(x) as e — > 0. We now 
apply the dominated convergence theorem to conclude that 



lim 

£-S>0 



ip(x) 



sin(27r£x) 



cf£ ) dx = n / ip(x)(— isga(x)) dx, 



whence the Fourier transform of pv(^) is — m sgn(^). 
The convolution theorem ( 2.45[ ) now implies that 



7T 



-JF pv 



7T 



*V (0 = -PV - MO = -isgn(0^(0, 



and Plancherel's theorem (Theorem 1.37) yields the equality 

wwh = = ii -isgn&mh = uh = 



It follows that H is an L 2 Fourier multiplier with the symbol — zsgn(£), as 
was to be shown. □ 



2.4.2 The L p Theory 

We now tackle the main theorem of the present section, usually attributed 
to Marcel Riesz. While Riesz himself never studied the problem himself, the 
subject of his 1927 paper |Rie27aj is now known to be directly relevant to the 
study of the Hilbert transform. Riesz's original proof of the related result 
exploits the close relationship between the Hilbert transform and the Cauchy 
integral in complex analysis. A suitably modified version of this proof that 
serves directly as the proof of the L p -boundedness of the Hilbert transform 
can be found in Chapter 2, Section 3 of [SSllj . 

For the sake of brevity, we do not pursue the connections with complex 
analysis. Instead, we follow the approach in §§4.1.3 of |Gra08aj and base our 
proof on the following identity: 
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Lemma 2.56. (Hip) 2 = ip 2 + 2H(ip(H^)) for each if> e y{R d ). 

Proof of lemma. We let = — % sgn(£) be the symbol of the Hilbert trans- 
form. Taking the Fourier transform of the right-hand side, we obtain 

* (£) + 2m(0 * Hr) (£) 



+2m(£) / Tp(r})m(r})ilj(£, — rj) drj 



+2m(0 / ${ri)m(Z - Tjwte - 7]) dri; 



here we have used the convolution theorem (Theorem |1.41[ ). We average the 
last two quantities in the above inequality to conclude that 

J^ 2 + 2WCHV0)KO = J ^-^(l + m^K^lm^-ri)])^. 
Since 

1 + m{£)m{rj) + m(()m(( -77) = 1 - sgn(£) sgn(^) - sgn(£) sgn(£ - 77) 

' if 1] > £ > 

if £ = n > 

1 if£>77>0 

if £ > rj = 
if £ = 

if £ < f] = 
1 if£<77<0 

if £ = 7? < 
if r] < £ < 

if I > 1 or f = 
if I = 1 or rj = 
•1 if I < 1 
m{ri)m{^ — 77), 
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it follows from the convolution theorem (Theorem 2.45) that 



J[i* + mvMr))]l(,) = / i>( V )i>(£ - V )m( V )m(Z - V ) d V 

Applying the inverse Fourier transform on both sides, we obtain the desired 
equality. □ 

We are now ready to establish the L p -boundedness of the Hilbert trans- 
form. 

Theorem 2.57 (M. Riesz). H e ~# PiP for all 1 < p < oo. 



Proof. We have already established that % G ^#2,2 (Theorem 2.55). Fix a 
positive integer n and assume inductively that 



W>\\p < A P \ ]YUP 
for p = 2 n . Lemma [2.56 implies that 

m\\ 2p = \\m)% /2 = u 2 + mwmy 2 , 

and so 



IWII 



2p 



< 



+2\\n(mm\ P ) 



1/2 



< (||^|| P + 2A P ||^(?^)|| P ) 1/2 

< (||^|| p + 2A p ||^|| 2p ||^)|| 2p ) 1/2 ; 

the last inequality follows from the Cauchy-Schwarz inequality. 
It follows that 



> 



/Wll 



2p 



V \W\\2p 

( Why 



9A ( Why 
p V IHI* 



A P +\/l + Al 



2p 



Ap-\/l + Al 
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whence 



A, 



Tm<m±<A. 



2p 



In particular, we conclude that 

\W\\ 2p < (a p + ^Ji + aIj m7p , 

which establishes T-L G ^#2 m .2 m for all positive integers m. We now invoke 



the Riesz-Thorin interpolation theorem (Theorem 1.43), to obtain the L p - 
boundedness for all p > 2. 

It remains to show that T-L G ^t v<p for 1 < p < 2, and this requires a 



standard duality argument. By Plancherel's theorem (Theorem 1.37), we 
have 

W,ip) = W,(p) = (-zsgn(£)^<£) 
= , -i sgn(£)^) = $, H<p) 



with respect to the standard inner product in L 2 ( 
the Riesz representation theorem[ implies that 



This, combined with 



IWII, 



sup 

IMI P '<i 



sup 

IMI P '<1 



sup (ip,H(p); 



sup (Hip, (p) 

IMI P '<i 



„/<i 



here the density of iS (R) in L p ' (R) allows us to use only the Schwartz function 
to compute the operator norm of the functional 

Since p' > 2, we can apply what we have proved above to conclude that 



WHip\\ p < sup (MINIMI? < A>' 

IMI<i 



This completes the proof. 



□ 
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2.4.3 Singular Integral Operators 

One crucial drawback of the theory developed in this section is that the 
Hilbert transform is only defined on M. In order to study multidimensional 
Fourier analysis in this setting, we would expect that it is necessary to define 
higher- dimensional analogues of the Hilbert transform. 

Definition 2.58. Given an integer 1 < j ' < d, we define the nth Riesz 

transform lZ n on W 1 to be the convolution operator 

_ . 1 / x n \ 
where Ud is the volume of the <i-dimensional ball. 



A minor modification of the argument given in the proof of Theorem 2.55 
shows that the Riesz transforms are L 2 Fourier multipliers. How about the 
IP boudedness? To this end, we remark that both the Hilbert transform and 
the Riesz transforms are integral operators of the form 

K{x - y)f(y) dy, 

where the kernel K barely fails to be integrable on the diagonal x = y. The 
transforms, therefore, are instances of singular integral operators, and their 
L p -theory is subsumed to that of a much wider class of operators. We give 
one such example: 

Definition 2.59. K G L 2 (IR d ) is a C alder on- Zygmund kernel if the following 
conditions are met: 

(a) The Fourier transform K of K is in L°°(¥L d ). 

(b) K G C l (M. d \ {0}) and 

\K 



\VK(x)\ < 



OG 



\x 



d+1 " 



Theorem 2.60 (Calderon- Zygmund). Let p G (l,oo). If K is a Calderon- 
Zygmund kernel, then the singular integral operator of Calderon-Zygmund 
type 

T/= f K(x-y)f(y)dy, 
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initially defined for f £ L 1 (IR d ) R L p (WL d ), satisfies the norm estimate 

\\Tf\\ p <A p \\f\\ p 

for some constant A p independent of f and ||if||2- Therefore, T can be ex- 



tended to a bounded operator T : L p (R d ) -»■ L p (R d ) 

The classical proof of the theorem makes use of the technique known as 
Calderon-Zygmund decomposition and can be found in Chapter 2, Section 
2 of |Ste70]. A key element in the proof is the interpolation theorem of 



Marcinkiewicz, which is discussed briefly in §2.7.5| In the present thesis, we 
shall derive this result as a consequence of the Fefferman-Stein interpolation 
theorem, which we take up in the next section. 
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2.5 Hardy Spaces and BMO 

We have seen in the last section that a clever use of the IRiesz-Thorin intcr-l 



polation theorem at intermediate points establishes the LP boundedness "ofj 



the Hilbert transform without the endpoint estimates. This is not entirely 
satisfying, for there is no clear way to generalize the proof to a wider class 
of operators. 

In this section, we take a stroll through a theory of interpolation that 
allows us to interpolate operators that are not necessarily bounded operators 
between Lebesgue spaces. In particular, we shall consider two new Banach 
spaces, H 1 and BMO, which serve as substitutes for L 1 and L°°, respectively. 
The section will culminate in an interpolation theorem between L 2 and BMO 
and its dual theorem between H l and L 2 . 

First presented by Charles Fefferman and Elias Stein in [FS72J, the theory 
of interpolation on H 1 and BMO is laden with intricate technical details and 
requires more than a mere section for a full development. We shall therefore 
confine ourselves to stating the main definition and theorems, with occasional 
sketches of proofs. 



2.5.1 The Hardy Space H 1 

Since the Hilbert transform of an L 1 -function is not necessarily in L 1 , it is 
natural to consider the subspace if 1 (IR) of L 1 (IR) consisting of L ^functions 
whose Hilbert transforms are in L 1 as well. Many important operators, how- 
ever, take functions on a multidimensional Euclidean space as their input, 
and it is thus of interest to consider the d-dimensional generalization H 1 (R d ) 
of iJ 1 (IR), consisting of L 1 -functions whose Riesz transforms are in L 1 as well. 
More generally, we define, for each p > 1, the subspace if p (IR d ) of L p (K d ) as 
follows: 

Definition 2.61. The (real) Hardy space H p (M. d ) of order p consists of func- 
tions / G LP(M. d ) such that the sum 

d 

\\f*Pe\\p + ^2\\{K n f) *p £ \\ p 

71=1 

is bounded for each approximations to the identity (p e )^ =1 . 

By the L p -boundedness of the Riesz transforms, the Hardy space H p (WL d ) 
coincides with the Lebesgue space L p (IR d ) for all 1 < p < oo. As for p = 1, 
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the above discussion implies that the Hardy space H 1 (M. d ) is strictly smaller 
than the Lebesgue space L 1 (M d ). Moreover, we can write if^-functions as 
linear combinations of particularly basic functions in H 1 , known as atoms. 

Definition 2.62. A <i-dimensional H l -atom is a measurable function a sup- 
ported in a ball B in M. d such that |a(a;)| < for almost every x G M d 
and / a(x) dx = 0. 

The atomic decomposition of H 1 can be stated as follows: 

Theorem 2.63 (Atomic decomposition of H l ). Every d- dimensional H 1 - 
atom belongs to ^{W 1 ), and every function f G iJ 1 (IR ci ) can be written as 
the infinite linear combination 

oo 

/ — A n a„ 

n=l 

of H 1 - atoms (a n )^ =1 with l^nl < °°; where the sum is understood as the 
limit of partial sums in L 1 (IR c( ). Therefore, a function f G L 1 (IR d ) is in 
/7 1 (]R c( ) if and only if f admits an atomic decomposition. 

Stein, in jSte93j , proves the equivalence of the Riesz-transform definition 
and the maximal-function definition — which we do not discuss — in Chapter 
III, Section 4.3. The equivalence of the maximal-function definition and 
the atomic-decomposition definition is proved in Chapter III, Section 2.2. 
The L 1 -norm-convergence characterization is taken from Chapter 2, Section 
5.1 in [SSllJ, which takes the atomic-decomposition characterization as the 
definition of H 1 . 

With the above theorem, we can now define the H x -norm of / G H 1 {M! i ) 

as 

oo 

||/||fli=infj^|A B | ) 

n=l 

where the infimum is taken over all atomic decompositions of /. H 1 is a 
Banach space with this norm. Furthermore, H l behaves much nicer than 
L 1 , because the singular integral operators of Calderon-Zygmund type are 
bounded operators from H 1 to L 1 : this is proved in |Ste93j . Chapter III, 
Section 3.1. 

We can, therefore, consider H 1 as a better substitute for L 1 in many 
cases. As is the case with L 1 , the dual of H l can be realized as a concrete 
space of functions. This is the space of bounded mean oscillations, which we 
shall define in due course. 
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2.5.2 Interlude: The Maximal Function 

In order to chasracterize the dual of H 1 , we must find a suitable substitute 
space for L°°. To this end, we shall shift our focus from controlling the 
functions themselves to dealing with their mean values instead. We review 
the basic theory of integral mean values in this section, focusing in particular 
on a dominating function of the mean-value function, the Hardy-Littlewood 
maximal function. 

For each 5 > 0, we recall that the mean value (A$f)(x) of a measurable 
function / at x G M. d is defined by 




The Lebesgue differentiation theorem guarantees that the mean value is a 
good approximation of the actual function. More precisely, if / : M. d — > C is 
locally integrable, then 

Jm(^/)(x) = /(x) 

for almost every x. Proving such a convergence result often requires an esti- 
mate on the size of the operator. For the mean-value operator, we introduce 
the following maximal-estimate operator: 

Definition 2.64. The Hardy-Littlewood maximal function of a measurable 
function / on R. d is 

(Mf)(x) = sup(A 5 \f\)(x) = sup — / \f(y)\dy. 

<5>0 5>0 \-E>S{X)\ Jb s (x) 

Similar to the Hilbert transform and the Riesz transforms, the maximal 
function does not map L 1 into L l . Indeed, if / e L 1 (IR d ) is not of L 1 -norm 
zero, then there exists a ball B in IR d such that J B |/| > 0. We fix some 5 > 
such that B is contained in B s (l) and set 
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For each |x| > 1, we can now establish the following lower bound: 
(Mf)(x) > l — - [ \f(y)\dy 

> u.L, / \f(y)\dy 



m d \B s {i)\ j Bs (i) 
k 



IX 



d • 



It follows that Aif is not integrable on M. d . 

As a substitute for the norm estimate, we have the following inequality: 

Theorem 2.65 (Hardy-Littlewood maximal inequality). Iff G L 1 (]R d ), then 
we have the weak-type estimate 

m{x : (Mf)(x) > a} < ^\\f\U, 

a 

where Ad is a constant that depends only on the dimension d. 

The standard proof of the inequality makes use of the Vitali covering 
lemma and is covered in many standard textbooks in real analysis. See, 
for example, Chapter 3, Theorem 1.1 in [SS05] Theorem 3.17 in [Fol99j . or 
Section 7.5 in |Rud 86j . Even though M. is not of type (1, 1), we have the 
L°°-norm estimate 

IIWIUHI/IU 

and, using these endpoint estimates, we can establish the following interpo- 
lated bounds: 

Theorem 2.66 (L p -boundedness of the maximal function). If f & L p (M. d ) 
for some 1 < p < oo, then Aif e L p (IR d ) and 



II Mf\\ p < A p , d \ u up , 

where A v ^ is a constant that depends only on p and the dimension d. 

The norm estimates are established by splitting M.f into its large and 
small parts: see pages 4-7 in |Ste70j for the proof. The idea of the proof can be 
generalized to establish an interpolation theorem for "weak-type" endpoint 
estimates. This is the theorem of Marcinkiewicz and serves as a starting 



point for the real method of interpolation, which is discussed in ^2.7.5 
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2.5.3 Functions of Bounded Mean Oscillation 

We now consider a space of functions whose integral mean values are con- 
trolled. 

Definition 2.67. The space BMO(lR d ) of bounded mean oscillations on M. d 
consists of locally integrable functions / on IR d such that 

^-J g \f(x)-f B \dx (2.10) 

is uniformly bounded for all balls B, where fs is the integral mean value 

f(x) dx 



1 

\B\ 



B 



over B. 



The infimum of all uniform bounds of (2.10) is denoted by H/Hbmo Pro- 
vided that we consider the quotient space given by the equivalence relation 

/ ~ g f = g + k for some constant k, 

|| • 1 1 bmo is a complete norm on BMO(IR d ). Functions of bounded mean 
oscillation are "nearly bounded" in the following sense: / G BMC^IR^) if and 
only if its sharp maximal function 

fi(x) =Bap -L [ \f(y)-f B \dy 
bbx |-d| Jb 

is bounded. Since the sharp maximal function is dominated by the Hardy- 
Littlewood maximal function at each point, we have the norm estimate 

WfX<\\Mf\\ p <A P Af\\ p 

for all 1 < p < oo. 

We now turn to the remarkable observation of C. Fefferman that there 
is a duality relationship between H 1 and BMO, analogous to that of L 1 and 
L°°. In other words, each bounded linear functional I on if 1 (IR d ) can be 
represented as 

/(/)= / f(x)u(x)dx (2.11) 
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for a unique u G BMO(IR d ). We do not have a substitute for Holder's in- 



equality, however, and so the integral in (2.11) need not be well-defined. 



To bypass the problem, we first consider bounded linear functionals on the 
space H^(M. d ) of bounded, compactly-supported i7 1 -functions with integral 
zero. Hl(M. d ) is precisely the collection of finite linear combinations of H l - 
atoms, which is dense by the argument in Chapter 3, Section 2.4 of |Ste93j . 



The integral in (2.11) then converges and remains the same for all repre- 
sentatives of the equivalence class [u] in BMO. Furthermore, we can invoke 



Theorem 1.11 to extend the linear functionals of the form (2.11) onto H 1 . 



With this, we can now state the fundamental theorem of Charles Feffer- 
man, which is the first main result in |FS72j : 



Theorem 2.68 (C. F efferm an duality). If u E BMO(E 
functional of the form (2.11), initially defined on H^(M, d ) 



d ), then the linear 
and extended onto 



is bounded and satisfies the norm inequality 



< k\\u\\ BM o 

for some constant k. Conversely, every bounded linear functional I on ^(W 1 ) 



can be written in the form (2.11) and satisfies the norm inequality 

IMIbmo < k'\\l\\ 

for some constant k' . 



See Chapter 3, Theorem 1 in |Ste93] for a proof. With the duality theo- 
rem, we can establish a reverse norm estimate 

\\f\\ P <KJf% 

for all 1 < p < oo. A proof of the inequality can be found in Chapter 3, 
Section 2 of |Ste93] . 

2.5.4 Interpolation on H 1 and BMO 

We now come to the main interpolation theorem of the chapter. 

Theorem 2.69 (Fefferman-Stein interpolation, BMO version). For each 9 
in (0, 1), we have 

[L 2 (M d ),BMO(ir% = L Pe (R d ) 
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in the language of complex interpolation, where 

2 

Pe 



1-9 



This generalizes the Stein interpolation theorem in the following sense: 
For each z in the closed strip 



{z G C : < Rez < 1}, 

let us assume that we have a bounded linear operator T z : L 2 (R d ) — > L 2 (R d ) 
such that each 

z^ [ (TJ)g 

is a holomorphic function in the interior of S and is continuous on S and 
that the norms \\T Z \\ L 2^. L 2 of the operators are uniformly bounded. If there 
exists a constant k such that 



m y fh<k\\f\\i 

for all f G L 2 (R d ) and y el and that 

\\Tl+iyf\\BMO < ^ll/lloo 

for all f E L 2 (R. d ) fl L°°(IR d ) and y G M, then we have the interpolated bound 

\\Tef\\ Pe < MI/IU 
for each 9 e (0, 1) and every f G L 2 fl LP , where 



Pe 



9' 



The uniform boundedness hypothesis in the above theorem is for clarity's 



sake can be relaxed to resemble properly the hypothesis of the Stein interpo- 



lation theorem The proof makes uses of the properties of the sharp maximal 



function and can be found in Chapter 3, Section 5.2 of |Ste93j . 

Using the |Fefferman duality theorem we can now modify the argument 
given in the proof of the L p -boundedness of the Hilbert transform| to establish 
the following dual result: 
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Theorem 2.70 (Fefferman-Stein interpolation, H 1 version). For each 9 in 
(0, 1), we have 

[H 1 (R d ),L 2 (R d )} e = L Pe (R d ) 
in the language of complex interpolation, where 

2 



2-9 



This generalizes the Stein interpolation theorem in the following sense: 
For each z in the closed strip 

{z G C : < Rez < 1}, 

let us assume that we have a bounded linear operator T z : L 2 (R d ) — > L 2 (R d ) 
such that each 



z^ [ (TJ)g 



is a holomorphic function in the interior of S and is continuous on S and 
that the norms \\T z \\l2_ ¥L 2 of the operators are uniformly bounded. If there 
exists a constant k such that 

\\T iy fh < k\\f\\l 
for all f e L 2 (R d ) and y eR and that 

||7W||i < k\\f\\ H x 
for all f G L 2 (R d ) D H l (R d ) and y 6 R ; then we have the interpolated bound 

\\T e f\\ pg < kg\\f\\ p9 

for each 9 e (0, 1) and every f G L 2 fl LP , where 

2 



Pe 



2-9' 



The H 1 interpolation theorem can be used to study linear operators that 
are not necessarily of type (1,1). In particular, the Fefferman-Stein the- 
ory settles the L p -boundedness problem of the singular integral operators of 
Calderon-Zygmund type, which include the Riesz transforms. We conclude 
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the section with a remark that the development of the Fefferman-Stein the- 
ory does not make use of the L p -boundedness of the Riesz transforms, lest 
our argument be circular. 

We remark that the L°° — > BMO boundedness hypothesis in Theorem 



2.69 can be replaced by the more general L p — > BMO boundedness hypothesis 



for some p G (l,ooj. The same proof then establishes the corresponding 
interpolation theorem, and the duality argument shows that the H 1 — > L 1 
boundedness hypothesis in Theorem |2 . 70| can be replaced by the more general 
H 1 — > L p hypothesis for some pG [1, oo). We shall have an occasion to use 
this formulation of the if ^-theorem in the next section. 
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2.6 Applications to Differential Equations 

We have developed a number of tools for studying linear operators on function 
spaces throughout the present thesis. In this section, we shall apply them 
to the study of linear partial differential equations and derive a few results. 
Standard theorems that do not make use of interpolation theory will simply 
be cited with references for proofs. 

What are linear partial differential equations? Recall that the higher- 
order derivatives of a function / on Mr can be written as D a f(x), where a is 
an appropriate (^-dimensional multi-index. The symbol D a can be thought of 
as an operator on a suitably defined space of functions on M. d : for example, 
the symbol 

D <i,o,i) = JLJL 

dx\ dx 3 

takes functions in C 2 (M 3 ) to functions in C(M 3 ). We can then define a d- 
dimensional linear differential operator to be an operator L on y'{M d ) that 
takes the form 

N 

(Lu)(x) = Y,Ux)D a -u{x), (2.12) 

71=1 

where each f n is a function on Mr and a n a (^-dimensional multi-index. Since 
u is a tempered distribution, the derivatives are taken to be weak derivatives. 

Given a linear differential operator L, we consider a linear partial differ- 
ential equation 

Lu = C (2.13) 



for some constant C. We say that a linear PDE (2.13) is homogeneous if 
C = 0, and inhomogeneous otherwise. 



2.6.1 Fundamental Solutions 

We begin by examining constant- coefficient linear partial differential equa- 
tions. These are linear partial differential equations such that the coefficients 



f n of the associated differential operator (2.12) are constants. As such, every 



constant-coefficient linear PDE can be written in the form 



P{D)u = v, 



(2.14) 



where P is a polynomial in d variables and v a function on M . 
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We have seen that the Fourier transform turns differentiation into mul- 



tiplication by a polynomial (Proposition 1.30). It is natural to expect that 
every differential equation of the form (2.14) can be solved by taking the 
Fourier transform of both sides, dividing through by the resulting polyno- 
mial factor, and taking the inverse Fourier transform. To make this heuristic 
reasoning precise, we introduce the following notion: 

Definition 2.71. A fundamental solution of a constant-coefficient linear 



PDE ( |2.14[ ) is a tempered distribution E G y"(R d ) such that 

P(D)E = 5°. 



If v is a Schwartz function, then Theorem 2.43| implies that E * v G C c 
Furthermore, 

P(D)(E *v) = (P(D)E) *v = 5°*v = v, 



whence u = E * v is a solution to the constant-coefficient linear PDE (2.14). 
A basic existence result is the following: 

Theorem 2.72 (Malgrange-Ehrenpreis). Every constant- coefficient linear 
PDE has a fundamental solution. 

See Theorem 8.5 in |Rud91j for a proof, which makes use of the rigorous 
version of the heuristic reasoning presented above. 

2.6.2 Regularity of Solutions 

Recall that the (i-dimensional Laplacian is defined to be the differential op- 
erator 

d f) 2 
A = Y—. 

71=1 " 

Laplace's equation is the associated homogeneous PDE 

Am = 0, 

and a solution to Laplace's equation is referred to as a harmonic function. 

It is a standard result in complex analysis that every two-dimensional 
harmonic function is in C 2 (IR d ), as it is the real part of a holomorphic function. 
A higher-dimensional generalization of the harmonic function theory, which 
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can be found in Chapter 2 of |SW71j . establishes the analogous result for 
higher dimensions. Such a result is called a regularity theorem and establishes 
a higher degree of regularity for the solutions of a certain differential equation 
than is a priori assumed. 

To this end, we shall consider a class of differential operators that gener- 
alizes the Laplacian. 

Definition 2.73. Let N e N. A linear differential operator L is elliptic of 



order N if all multi- indices a n in the expression (2.12) satisfy the inequality 



\ot n \ < N and if at least one coefficient function f no with |a no | = N is nonzero. 

An elliptic linear partial differential equation is a differential equation 
whose associated differential operator is elliptic. Examples include Laplace's 
equation and its complex- variables counterpart, the Cauchy-Riemann equa- 
tions. The "once differentiable, forever differentiable" regularity result of 
complex analysis generalizes to elliptic operators: 

Theorem 2.74 (Elliptic regularity, C°°-version) . Let L be an elliptic operator 
of order N with coefficients in C°° . In addition, we assume that all coefficients 
of order N are constants. If v G C oc (M. d ), then every u e §'{R d ) satisfying 
the identity 

Lu = v 

is in C°°(M> d ). This, in particular, implies that every solution u of the homo- 
geneous differential equation 

Lu = 

is in C°°(M. d ). 

The above theorem is a consequence of a more general theorem, which 
will be presented in the next subsection. 



2.6.3 Sobolev Spaces 

So far, we spoke of regularity only in the sense of strong derivatives. Linear 
differential operators were defined in terms of weak derivatives, however, and 
we have yet to discuss the notion of regularity appropriate for this general 
context. Our immediate goal is to make precise the notion of "well-behaved 
weak derivatives". This is done by introducing Sobolev spaces, which are 
subspaces of IP containing functions whose weak derivatives are also in L p . 
We shall then apply Calderon's complex interpolation method to characterize 
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"in-between" smoothness, which we require in order to state the general 



version of the elliptic regularity theorem 



We begin by defining integer-order Sobolev spaces. 

Definition 2.75. Let 1 < p < oo and fceN. The L p -Sobolev space of order 
k on M. d is defined to be the collection W k ' p (M. d ) of tempered distributions u 
such that D a u G L p (R d ) for every multi-index \a\ < k. The derivatives are 
taken to be weak derivatives. 

Note that W 0,p = L p . W k ' p (R d ) is a Banach space, with the norm 

i/p 

\u\\ wk , P = [ E II^IIp 

Ja|<* 

Therefore, we can consider the intermediate spaces 

W s ' p {R d ) = [W k ' p (R d ),W k+1 ' p (R% 
in the sense of complex interpolation, where k < s < k + 1 and 

1 _ (1-0) , 



jfe + 1 



This allows us to speak of "fractional derivatives", in the sense that / G L p 
has derivatives "up to order s" in case / G H /rs ' p (R <i ). 

For p = 2, we can find an alternate, more concrete characterization of L 2 
Sobolev spaces, which are Hilbert subspaces of L 2 . Given a real number s, 
we definejlif s (IR d ) to be the space of tempered distributions u such that 



(o s m g l 2 

where 

(0 = v / TTp 



i d ), 



as was defined in §1.3[ Since the Fourier transform turns differentiation into 
multiplication by a polynomial, the equality H s = W s ' 2 is easily established 
for each positive integer s. We now show that the identification remains true 
for all s > 0. 



6 It is an unfortunate coincidence in the history of mathematical analysis that H p refers 
to the real Hardy space and H s the L 2 -Sobolev space. The convention is to use p for the 
order of a real Hardy space and s for that of an L 2 -Sobolev space. More often than not, 
the context makes it clear which of the two spaces is in use. 
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Theorem 2.76. W s ' 2 (R d ) = H s (R d ) for all s>0. 

Proof. We have already established the theorem for t G N. Fix s > and 
define the multiplication operator 

(M {x yu)(x) = (x) f u(x) 

on H s (R d ). Since (x) s G ( t^ co (R d ) and D a {x) 1 G L°°(R d ) for each multi-index 
a, we see that M( x y maps H t (R d ) into H t (R d ). Furthermor^J the operator 

A* = &~ l M {x) t& 

is an unbounded self-adjoint operator on L 2 (R d ) with X>(A*) = H t (R d ). 
Indeed, u G H l (R d ) is, by definition, equivalent to G L 2 (IR d ), and 

Plancherel's theorem (Theorem 1.37) shows that this happens if and only if 
J^ 1 G L 2 (R d ). It now follows from Theorem that 



[L 2 {R d ),H\R d )} e = V(A te ) = H 



to i 



(2.15) 



for each 9 G [0,1]. The desired result can now be established either by 
invoking Theorem |2.31 or by establishing the identity 



[H to (R d ), H tl (R d )]g = H^ t0+etl {R d ) 



(2.16) 



which follows from a minor but more elaborate modification of the above 
argument. □ 

Note that the space H S (R ) remains well-defined for the negative values 
of s. Formula (2.16) shows that the negative-order L 2 -Sobolev spaces can 
be interpolated in the same manner. Furthermore, the negative-order L 2 - 
Sobolev spaces can be used to establish a representation thereom: indeed, 
for each s > 0, the dual of H s (R d ) is isomorphic to H~ s (R d ). 

We now state a generalization of the elliptic regularity theorem which 
can be applied to a L 2 -Sobolev space of any order: see, for example, Theorem 
8.12 in |Rud 91J for a proof. 

Theorem 2.77 (Elliptic regularity, Sobolev-space verison). Let L be an el- 
liptic operator of order N with coefficients in C°° . In addition, we assume 



7 See 6? 2.1.6 



for notation. 
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that all coefficients of order N are constants. If v E H s (M. d ), then every 
u E S>'(IR d ) satisfying the identity 

Lu = v 

is in C°°(M. d ). This, in particular, implies that every solution u of the homo- 
geneous differential equation 

Lu = 

is m H s+N (R d ). 

The H s - characterization of the L 2 -Sobolev spaces is explained most nat- 
urally in the framework of pseudodifferential operators, which are operators 
f*->Tf given by 

(Tf)(x) = T~ x (a(x,0m)) (x) = [ a(x,0f(0e 2mx< d£. 

a(x,£) is called the symbol of the pseudodifferential operator T, and we often 
write T a to denote T in order to emphasize the symbol of T. If a(x, £) = cii(£) 
is independent of x, then T is a Fourier multiplier operator 

£7(0 = ai(o/(o 



discussed in §£2.3.4 if a(x,£) = a 2 (x) is independent of £, then T is a 
multiplication operator 

(T a f)(x) = a 2 (x)f(x) 

used in the i7 s -characterization of the L 2 -Sobolev spaces. We have seen above 
that the pseudodifferential operators T a with symbols a(x,£) = (£) n mimic 
the behavior of nth-order partial differential operator, and so pseudodiffer- 
ential operators can rightly be seen as generalizations of regular differential 
operators. 

The most commonly used symbol class, denoted by S m , consists of 
Qoor^d x ]R d )_ m aps a(x,£) satisfying the estimate 

i^a(^oi<^(i+ieir Ha| 

for each pair of multi-indices a and /3. In particular, the class S m includes all 
polynomials of degree m. It can be shown that pseudodifferential operators 
with symbols in S m map the Schwartz space ^(M. d ) into itself. Furthermore, 
if a E S mi and b E S m2 , then there is a symbol c E S mi+rrl2 such that 



T c = T a o T b . 
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If m = 0, then the pseudo-differential operators defined on y(§L d ) can be 
extended to a bounded operator from L 2 {R d ) into itself. 

See Chapters VI and VII in [Ste93j for an exposition of pseudodifferential 
operators in the context of the theory of singular integral operators, and 
Chapter 7 in |Tayl0a| for an exposition in the context of partial differential 
equations. Further developments in the theory of Sobolev spaces can be 
found in Chapter 4 in |Tayl0b| and Chapter 13 in |Tayl0c| . 



2.6.4 Analysis of the Homogeneous Wave Equation 

We now turn to the wave equation, an archetypal example of another class of 
differential equations referred to as hyperbolic partial differential equations. 
Unlike Laplace's equation, the wave equation takes two different kinds of 
variable: the one-dimensional time variable t and the (i-dimensional space 
variable x. Therefore, the domain space is (l + <i)-dimensional and we denote 
it by R d+1 . 

Definition 2.78. The D'Alembertian on IR 1+d is the differential operator 

d 2 d d 2 

n=l 

A (linear) wave equation is a differential equation whose associated differen- 
tial operator is the D'Alembertian. 

We shall study the Cauchy problem for the wave equation, which means 
that we will analyze the solutions of the equation with respect to given initial 
conditions at t = or x = 0. To this end, we consider u(t,x) to be an 
y (IR d )-valued function for each fixed t and a C 2 (R d ) -map for each fixed x. 

Theorem 2.79. The Cauchy problem for the homogeneous linear wave equa- 
tion admits the following solution: 

(a) The solution to the homogeneous linear wave equation \3u = with initial 
conditions u(0, 0) = / and u t (0, 0) = is 

u(t,x) = J^" 1 [cos(f !£!)/(£)] (x) = J e 2 ^cos(t|£|)/(0^. (2.17) 
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(b ) The solution to the homogeneous linear wave equation Ou 
conditions u(0, 0) = and u t (0, 0) = g is 



with initial 



u(t, x) 



sin(t|el) 



7(0 



>g sin(t|e|) 



(2.18) 



Combining the above result with Plancherel's theorem (Theorem 1.37), 
we conclude that the operator that takes the initial condition / in (a) and 
produces the solution (2.17) is an L 2 Fourier multiplier with the smooth 
symbol cos(i|£|). The same might be said about the operator that takes the 



initial condition g in (b) and produces the solution (2.18), whose symbol 

sin(^|) 



is integrated in the usual fashion at |£| =0; the result is analytic in |£|. 

Let us take a closer look at this operator. We consider a family of oper- 
ators 



T k g 



sin 



IF) 



for each fceN, which are defined a priori for Schwartz functions g. In this 
family, T\ is the "solution operator" we have discussed above. We shall apply 
the Fefferman- Stein theory to establish a boundedness result for T\. 

If we set k = 0, then is bounded at infinity, and sin(i|£|)(£) 

is smooth at |£| = 0, and so we obtain an L 2 -Fourier multiplier with a smooth 
symbol. How about the other end? If k — d + e for some e > 0, then the 
function 

is in L 1 , and so the operator maps L 1 boundedly to L°°. We can then apply 



the Stein interpolation theorem (Theorem 1.48) to obtain a boundedness 



result for T\. This result is not ideal, however: the index is off by e from the 
optimal index, and so we do not get the L p — > L p bound. 

We are therefore forced to consider the index k = d directly. The integral 



T d g 



sin 



r ) 



^(0 



now does not make sense for an arbitrary L 1 -function g. The function 



(0 1_ 
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only barely fails to be integrable, however, and the operator is actually 
bounded from H 1 to L°° . We now apply H 1 version of the Fefferman-Stein 



interpolation theorem (Theorem 2.70) to obtain the desired boundedness re- 
sult. 

This is a special case of a 1982 theorem of Michael Beals, which is stated 
in terms of Fourier integral operators 



(T x f)(x)= / e^^(x,0f(x)dx. 
Jm.d-i 

Here £ is a rf-dimensional real variable, A a positive parameter, ip a smooth 
function of compact support in x and £, and $ a real- valued smooth function 
in x and £. Furthermore, the Hessian 

is assumed to be nonzero on the support of ip. With these assumtions, we 
have the L 2 -bound 

l|T A /|| 2 < A\-^\\f\\ 2 , 

which subsumes the L 2 -boundedness property of the Fourier transform. 
Fourier integral operators generalize pseudodifferential operators discussed 
in the previous subsection and comprise an important class of integral oper- 
ators known as oscillatory integrals. See Chapters VIII and IX in |Ste93j or 
the second half of [Wol03j for an exposition on oscillatory integrals. 

The theorem of Beals, first stated in his 1980 thesis and subsequently pub- 
lished in |Bea82j . establishes the above boundedness result for every Fourier 
integral operator of the form 

(Tf){x) = J e 2 ™<t-^»a(0f(0d{, 

where a satisfies the growth condition 

i9Voi<Mi+i£ir mH/31 

for each multi-index /3 and ip is homogeneous of degree 1, viz., <p(Xx) = \<p(x) 
and analytic on IR d \ {0}. See Section 1 of [Bea82] for the precise statement 
and the proof. This result can be applied to the study of a wide class of 
hyperbolic partial differential equations: the details can be found in Section 
5 of |Bea82j . 
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2.7 Additional Remarks and Further Results 

In this section, we collect miscellaneous comments that provide further in- 
sights or extension of the material discussed in the chapter. No result in the 
main body of the thesis relies on the material presented herein. 

2.7.1. Let V be a Hausdorff topological vector space, p a seminorm on V, 
and k a positive real number. It can be showrf] that the "closed ball" M = 
{x E X : p(x) < k} satisfies the following properties: 

(a) M contains the zero vector; 

(b) M is convex, viz., v,w G M and < A < 1 implies (1 — X)v + Xw G M; 

(c) M is balanced, viz., v G M and |A| < 1 imply Xv G M; 

(d) M is absorbing, viz., each v G V furnishes a scalar A > such that 
A- 1 ^ G M. 

We say that V is a locally convex (topological vector) space if every open 
set in V containing the zero vector also contains a convex, balanced, and 
absorbing open subset of V. Locally convex spaces behave much like Frechet 
spaces, for the most part, except that they are not required to be metrizable. 
See Chapter 13 of |Lax02j for an exposition of the standard theorems in the 
theory of locally convex spaces. 

2.7.2. The support of a tempered distribution u is defined to be the inter- 
section of all closed sets K C M. d such that (cp, u) = whenever the support 
of if G ^(lR d ) is in IR d \ K. It is easy to see that the Dirac 5-distribution 
5 X ° is supported in the singleton set {xo}. More is true, however: if u is any 
tempered distribution supported in the singleton set {x }, then there exists 
an integer n and complex numbers A a such that 

u = X a D a 5 X0 . 

\a\<n 

In this sense, the Dirac 5-distributions are "atoms" of all distributions sup- 
ported at a point. See Proposition 2.4.1 in |Gra08a] for a proof. 



See, for example, Proposition 2 in Chapter I of [Yos80| . 
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2.7.3. We now place the method of complex interpolation in its fully general 
context, which requires the language of category theory. A category C is a 
collection of objects and morphisms. Contained in C is a class^]Ob C of objects 
and, for each pair of objects A and B, a set Homobc(^4 ; B) of morphisms. 
Furthermore, any three objects A, B, and C furnish a function 

Rom 0hC (B,C) x Homobc(A^) -» Rom 0hC (A,C) 
called a law of composition, satisfying the following properties: 

(i) If A, B, A', and B' are objects of C, then the two sets Homobc(A B) 
and Homobc^', B') are disjoint provided that A ^ A' or B ^ B' . If 
A = A' and B = B', then the two sets must coincide. 

(ii) For each object A of C, there is a morphism id^ G Homobc( y 4 ) A) such 
that (/, id/t) i — y f for all / G Honiobc(A -B) and (id.A,g) | — >■ / for all 
g G Homobc(-B, A), regardless of B G ObC 

(hi) If A, S, C, and -D are objects of C, then / G Homobc(^4> B), g G 
Hom bc(-B, C), and /i G Hom bc(C) D) satisfy the associativity rela- 
tion 

((h,g),f) = (h,(g,f)). 

Since laws of composition behave much like the composition operation be- 
tween two functions, we leave behind the cumbersome notation (/, g) i— )■ h 
and simply write 

/ o g = h or fg = h. 

We shall work in the category T of topological vector spaces, whose ob- 
jects are topological vector spaces (either over M. or C) and whose morphisms 
are continuous linear transformations. A category C\ is a subcategory of an- 
other category C 2 if every object and every morphism in C\ is an object and 
a morphism, respectively, in C 2 - We shall primarily be concerned one sub- 
category of T: namely, the category B of Banach spaces with bounded linear 
maps as morphisms. 

Given two categories C\ and C 2 , we define a covariant functor F : C\ — > C 2 
to be a "map of categories" that sends each object A G ObCi to another 

9 We use the term class here, for many standard collections of objects, such as the 
collection of all vector spaces, do not form sets under the standard axioms of set theory. 
We do not attempt to discuss the theory of classes — or any foundational issue, for that 
matter — in this thesis. 
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object F(A) G ObC 2 and each morphism / G Hom bc(A, B) in C\ to another 
morphism F(f) G Homob c{F( A), F(B)) in C%. Furthermore, a covariant 
function must satisfy the following properties: 



(i) F(fg) = F(f)F(g) for all morphisms / and g in C\ 

(ii) F(id^) = id^(A) for all objects A in C\. 



We now let B denote the category of all Banach couples (Definition 2.24), 
in which a morphism T : (A ,Ai) — > (B ,Bi) is a bounded linear map 
T : Aq + Ai — ?• Bq + i?i such that is a bounded linear map into Bq and 
a bounded linear map into B\. We shall need two covariant functors 
from B to B: the summation functor £ and the intersection functor A. £ 
sends (A , Ax) to A + A x and T : (A , Ax) — >■ (-Bo, Bi) to its direct-sum 
counterpart T : A + A 1 B + B 1 . A sends (A , Ai) to A fl Ax and 
T : (A), Ai) (B , Si) to the restriction TU onAl :i nAi^5 n Bi. 

Given a Banach couple (Aq,Ai), we say that a Banach space A is an 
interpolation space between A and A x if 



(i) A((A , Ai)) continuously embeds into A; 

(ii) A continuously embeds into £((Ao,Al)); 



(iii) Whenever T : (Aq,Ai) — > (Aq, Ai) is a morphism in B, the restriction 
T|a is a bounded linear map into A. 

More generally, two Banach spaces A and B are said to be interpolation 
spaces with respect to Banach couples (A ,Ai) and (B ,Bi) if 



(i) A((A , Ai)) continuously embeds into A; 

(ii) A continuously embeds into £((Ao,Al)); 

(iii) A((Bq, Bi)) continuously embeds into B; 

(iv) B continuously embeds into T,((B , Bi)); 



(v) Whenever T : (A , A\) — > (B ,Bi) is a morphism in B, the restriction 
T\a is a bounded linear map into B. 
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We remark that interpolation spaces A and B with respect to Banach couples 
(Aq, Ax) and (Bq, B{) are not necessarily interpolation spaces between Aq and 
A\ or between Bo and B\, respectively. 

An interpolation functor is a covariant functor F : B — >■ £> such that if 
(y4 , Ai) and (-B , B\) are Banach couples, then F((A , Ai)) and F((B Q , B\)) 
are interpolation spaces with respect to the two Banach couples. Further- 
more, F must send each morphism T : (A Q ,Ai) — > (B ,Bi) in B to its 
direct-sum counterpart T : A + Ai — > B + Bj. An interpolation functor F 
is said to be exact if every pair of interpolation spaces A and B with respect 
to a given pair of Banach couples (A , A\) and (.Bo, -Bi) satisfies the following 
norm estimate 

||71U->b < maxdlTllAo^Bo, HTHa^bJ • 



The Aronszajn-Gagliardo theorem 10 states that every Banach couple (Aq, A{) 
and its interpolation space A furnish an exact interpolation functor F such 
that Fq((A , Ai)) = A. In this sense, we can always find a suitable "interpo- 
lation theorem". 

Given 9 G [0, 1], an interpolation functor F is said to be exact of order 9 
in case 

||^IU-+-B < II^II^AboII^IIai^Bi 

for every pair of interpolation spaces A and B with respect to a given pair 
of Banach couples (A Q) Ai) and (f? ,-Bi). We can now see that the complex 
interpolation functor 

Cg((B ,Bi)) = [£?o,-Bi]<? 

is exact of exponent 9. Calderon introduces another exact interpolation func- 
tor of exponent 9 in |Cal64] . which we shall discuss in the next subsection. 



2.7.4. Here we use the framework established in the preceding subsection: 
see § §2.7.3 for relevant definitions. In this subsection, we shall define an 



exact interpolation functor that serves as a dual to the interpolation functor 
we studied in the main body of the thesis. 
As usual, we let 5" be the closed strip 

S = {z E C : < Rez < 1}. 
10 See Theorem 2.5.1 in |BL76j for a proof. 
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A *- space- generating functiorP^ for a complex Banach couple (B , Bi) is a 
function g : S — >■ Bq + i?i such that 

(a) g is continuous on S with respect to the norm of B + Bi, 

(b) g is holomorphic in the interior of S, as per the definition of holomor- 
phicity in § pX2 



(c) \\g(z) ||b +Bi < k(l + \z\) for some constant k independent of z G S; 

(d) g{iy x ) - g(iy 2 ) G B and g(l + iy x ) - g(iy 2 ) G B x for all y 1 ,y 2 ER and 



|gr||g = max ^ sup 

2/1,2/2 



9(wi) - g(m) 



yi - 2/2 



, sup 

S yi lS/2 



+ -^(l+% 2 ; 



2/i - V2 



is finite. 

We denote by Q(B , Bi) the collection of all *-space-generating functions. 
As was the case with BMO introduced in £2.5, we must take the quotient 
space of G(Bq, B\) modulo the space of constant functions to obtain a Banach 
space ( [Cal64] . §5). Given 9 G [0, 1], the dual complex inteporlation space of 
order 9 between Bq and B\ is defined to be the normed linear subspace 

B e = [B , Bi} = {v G B + B x : v = g\9) for some g G G(B , B x )} 

of Bq + B\, with the norm 

\\ v \\ 9 = \\ v \\b<> = inf \\g\\. 

g>(0)=v 

Here g'{9) is the derivative of g at 9, defined in the usual way as the limit of 
the difference quotient. For each 9 G [0, 1], the interpolation space [B , Bi] e is 
isometrically isomorphic to the quotient Banach space Q(B , Bi)/Af e , where 
Af e is the subspace of Q(B ,Bi) consisting of all functions g G Q(B ,Bi) 
such that g'{9) = ([Cal64], §6). Furthermore, the functor 

C\(B ,B 1 )) = [B ,B 1 ] e 

is an exact interpolation functor of order 9 ( |Cal64j . §7). 



11 In tune with Definition 2.26 this is not a standard term and will not be used beyond 
this subsection. 
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If either B or B\ is reflexive, then 

[B ,B 1 ] 9 = [B ,B 1 ] e and 
for all 9 G (0, 1) ( |Cal64j . §9.5). In general, however, we have the inclusion 

[B ,B 1 ] e C[B ,B 1 ] e 

and the inequality 

\\f\\ e <\\f\\o 

for all 9 G [0, 1]; this is the equivalence theorem ( |Cal64j . §8). Furthermore, 
for each 9 G (0, 1), we have the representation theorem 

([B ,B 1 ] e T = [B* ,B* 1 ] e , 

where the isomorphism is isometric; this is the duality theorem ([Cal64j, 
§12.1). The equivalence theorem and the duality theorem can be used to 



prove the reiteration theorem, which is stated as Theorem 2.31 in the present 
thesis ( |CaI64j . §12.3). 

2.7.5. The standard passage from the Hardy-Littlewood maximal inequality 



(Theorem 2.65) to the L p -boundedness of the Hardy-Littlewood maximal 



function (Theorem 2.66) can be generalized to establish another interpolation 
theorem, due to Jozef Marcinkiewicz. We note the crucial fact that the 
Hardy-Littlewood maximal function is not linear. Indeed, the Marcinkiewicz 
interpolation theorem applies to sublinear operators, which we shall define 
in due course. 

Let (X, n) and (Y, v) be a-finite measure spaces. We fix a vector space D 
of /^-measurable functions on that X that contains the simple functions with 



finite-measure support. Unlike the Riesz-Thorin interpolation theorem, the 



proof of the Marcinkieiwciz interpolation theorem makes use of real- variable 
methods and does not require the functions to be complex-valued. We also 
assume that D is closed under truncation!^! 

We say that an operator T on D into the vector space Ai(Y, //) of im- 
measurable functions on Y is subadditive if, for all /i,/2 G D, we have the 
inequality 

+ / 2 )(^)l < 1(^/0(^)1 + 1(^/ 2 )(^)| 



12 See Definition 1.42 for the definition. 
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at almost every x G Y. Given 1 < p < oo and 1 < q < oo, a sublinear 
operator T : D — > Ai (Y, /i) is said to be of weak type (p, q) if there exists a 
constant k > such that 

\{xeR d :(Tf)(x)>a}\< (M 
The infimum of all such is referred to as the weak (p, q) norm of T. If 



g = oo, then T is of weak type (p, oo) if T is of type (p, oo) (Definition 1.42); 
in this case, the weak (p, oo) norm of T is defined to be the (p, oo) norm of 
T. 

Theorem 2.80 (Marcinkiewicz interpolation). Let 1 < po < qo < oo and 

1 < Pi < Qi < oo and assume that q ^ qi . If T is a subadditive operator 
simultaneously of weak type (po,?o) o/ weak type (pi,gi), iaen T is of 
type (pg, qo) with the norm estimate 

\\T\\ L ve^.L*e < \\T\\wo-+l9o\\T\\zpi-). l <ii 
for each 9 6 [0, 1], where 

p- 1 = (i-e) P ^ + e P 7 1 ; 

q^ 1 = (l-%- 1 + ^ r 1 . 
We remark that the interpolation theorem holds only in the lower triangle 



of the Riesz diagram (Figure 1.4). The interpolation theorem was announced, 
without proof, on the main diagonal pj = qj by Jozef Marcinkiewicz in his 
1939 paper [Mar39j. The extension to the lower triangle was first given by 
Zygmund in | Zyg56] . A proof of the theorem on M d can be found in Appendix 
B of |Ste70j . and a minor modification of this proof yields the theorem as 
stated above. 

The key element in the proof is the rearrangement of the function Tf. 
The theory of rearrangement-invariant spaces encodes the main idea of the 
proof and generalizes the Marcinkiewicz interpolation theorem to a much 
wider class of spaces known as Lorentz spaces. See Chapter V, Section 3 
of |SW71] or §1.4 in |Gra08aj for an introduction to the theory of Lorentz 
spaces and the proof of the generalized Marcinkiewicz interpolation theorem 



in this setting. Analogous to the Riesz-Thorin interpolation theorem the 



Marcinkiewicz interpolation theorem can be generalized in the framework of 
interpolation of spaces as well. This is the method of real interpolation, first 
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developed by Jacques-Louis Lions and Jaak Peetre. A brief introduction to 
the real method of interpolation is given in Chapter 2 of B1.7(ij. For a more 
detailed treatment, any one of the many monographs on the subject of real 
interpolation can be consulted: the classical one is |BS88j . 

2.7.6. In this subsection, we provide a quick survey of the Fourier inversion 
problem. Recall that we have defined the Fourier transform on L p (IR d ) for 
all 1 < p < 2. For p > 2, the LP Fourier transform in general is a tempered 
distribution, so let us restrict our attention to the usual L l Fourier transform 
on L 1 (IR a! ) n L p (R d ). A natural question to ask is as follows: to what extent 
does the identity 

/ = (/) V 

holds for general / £ L p 7 More precisely, if we define 

S R f(x)= [ /(fle^'de, 

we may ask ourselves whether SrJ converge to / as R — > oo, and, if so, in 
what sense. 

If / and / are both in L\R d ), the 



Fourier inversion formula 



tells us 

that Sr/ converges to / pointwise. A strong regularity condition, such as 
continuous differentiability with compact support, will guarantee that the 
convergence will be uniform. 

What can we say about the Fourier transform of general LP functions?. If 
p < 1, then the Lebesgue spaces are pathological, and so the outlook is bleak. 
p = 1 is hopeless as well, for Andrey Kolmogorov exhibited an L 1 function 
whose Fourier inversion formula fails to converge at every point. Chapter 



3, Section 2.2 of [SS05j contains an exmaple for Fourier series. This result 
can be understood as a consequence of the uniform boundedness principle, a 
corollary of the Baire category theorem: 

Theorem 2.81 (Banach-Steinhaus, Uniform boundedness principle). Let V 

be a Banach space and £ a collection of bounded linear functionals on V . If 

sup|/(/)| < oo (2.19) 



for each f £ B, then 



sup ||/|| < oo. 
lec 



The conclusion continues to hold if we assume that (2.19) holds on a subset of 



B that is not a countable union of nowhere dense sets ( "of second category"). 
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The uniform boundedness principle produces many functions whose 



Fourier series diverge on a dense subset of [— tt, 7t\. This divergence result 
for the Fourier series can be morphed into a divergence result for the Fourier 
transform, via the so-called transference principle: see §3.6 in |Gra08aj for a 
discussion. 

How about 1 < p < oo? For n — 1, there is the pointwise almost- 
every where result, established for L 2 functions by Lennart Carleson and ex- 
tended to LP functions for all 1 < p < oo by Richard Hunt two years later: 

Theorem 2.82 (Carleson-Hunt, 1966 & 1968). Fix 1 < p < oo, let f e 

L P (IR), and assume that f exists. Then 



f(x) = lim S R f{x) 

R-^-oo 



almost everywhere. 



The proof is notoriously intricate and is still considered to be one of the 
most difficult proofs in mathematical analysis. As of May 2012, the analogous 
result for higher dimensions is still open. 

How about the LP convergence? For n = 1, we have the classical theorem 



of Marcel Riesz (Theorem 2.57). This result, combined with the uniform 
boundedness principle, implies that the L p convergence does take place for 
all 1 < p < oo. As for n > 1, the L 2 convergence follows from Plancherel's 



theorem (Theorem 1.37) and the uniform boundedness principle. 

It remains to investigate the L p convergence of the Fourier series for 1 < 
p < oo, p 7^ 2. In |Ste56j . E. Stein provides the first progress towards solving 
the problem, as an application of his interpolation theorem. Recall from the 



proof of the Fourier inversion formula that multiplying a nice function — the 



Gaussian in the proof — to / in the integrand facilitated the convergence of 
S R f. Taking a cue from this, we consider the Bochner-Riesz mean 

S s R f(x) = / /(O* 8 "** (l - di 

for each R > and 5 > 0. Note that S R is the regular spherical summation. 
Does S R f converges nicely to / for 5 > 0? Stein obtained a partial result in 



[Stc56j as an application of the Stein interpolation theorem 



Theorem 2.83 (Stein, 1956). S R is a linear operator of type (p,p) for 1 < 
p < 2, provided that 5 > (2/p) — 1. 
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Perhaps surprisingly, the above result did not extend to a proof of the 
L p convergence of multidimensional Fourier transform. Instead, Fefferman 
obtained the following negative result in |Fef71j : 

Theorem 2.84 (Fefferman, 1971). The spherical summation of the Fourier 
inversion formula converges in the norm topology of LP if only if p = 2 viz., 

lim \\f-S R f\\ p = 0. 

for all f G LP whose Fourier transform exist if only if p = 2. 

What can be salvaged from Stein's result? For technical reasons, the 
result does not hold unless 

1 1 

p ~ 2 

This, nevertheless, does not say anything about when the estimate does hold. 
We might hope for the following: 

Conjecture 2.85 (Bochner-Riesz conjecture). If 5 > 0, 1 < p < oo, and 

1 1 
p ~ 2 

then 

hm \\sy-f\\ p = o 

R—Hx> 

for all f EL p . 

This conjecture is tied to many important problems in mathematical anal- 
ysis, among which are the Stein restriction conjecture and the Kakeya con- 
juecture. A quick exposition leading up to these conjectures can be found in 
|Wol03j . For a more detailed survey, see Chapters VIII, IX, and X of |Ste93j 
or Chapter 10 of |Cra08bj . 
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